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Introduction 



Quantum computing is concerned with examining how the laws of quantum mechanics can 
be used to construct computation devices that would be possibly more powerful than the 
"classical ones", represented by Turing machines. As was pointed out in [2], computational 
devices behaving according to the quantum mechanics are not captured by the classical 
Church- Turing thesis stating that any practically realizable computational device can be 
simulated by a Turing machine. The reason is that the model of Turing machines is built 
implicitly assuming the laws of classical (e.g. nonquantum) physics hold, through it may seem 
that in the model there is no physics involved. 

Thus, if we believe that the quantum theory is correct, we have to generalize the thesis to: 
" Any practically realizable computational device can be simulated by a quantum mechanical 
device." This automatically contains the classical Church- Turing thesis, since huge enough 
quantum systems may be described by the laws of classical physics with practically infinite 
precision. This generalized thesis gave rise to interest in quantum computing. 

This research field entered much into fashion mainly thanks to Peter W. Shor, who pub- 
lished in 1994 his algorithm that can efficiently factor a given number Here the word 
efficiently means in polynomial time with respect to the length of the input. It's exponentially 
faster than any other known algorithm for a deterministic Turing machine. The problem of 
finding a factor of a given number is the core of many cryptographic systems, where it is 
assumed that to factor a large number is practically impossible in a reasonable time. It is 
believed to be a one-way problem: Multiplying up few numbers is very easy and can be done 
efficiently, while any known algorithm for a Turing machine requires exponential time. 

After the algorithm, nowadays very famous, appeared, there has been a big rush for other 
algorithms showing an exponential speedup over classical algorithms. Surprisingly, until now, 
no other quantum algorithm 1 in the nonrclativized (eg. without oracles) setting that would 
offer such a speedup is known. Furthermore, there is not any physical realization known, 
that would allow to build computers with an unbounded amount of quantum bits without an 
exponential slowdown. In any case, before any serious attempt to build a quantum computer 
the question "How useful could a quantum computer be?" must be answered. 

The class BQP that contains all problems efficiently computable on a quantum computer 
should be well understood and placed into to the right place between other well-known 
traditional complexity classes like NP, BPP, etc. However nowadays, even fundamental 
questions like "Does BQP contain NP-complete problems?" and "Does BQP belong to 
the polynomial hierarchy?" still remain unanswered. Nor the lower bounds nor the upper 
bounds of the complexity of the class BQP are very tight. A possible reason can be that 
finding answers to such fundamental questions about the class BQP would immediately 
answer other fundamental questions from classical complexity theory that are considered to 
be very hard. For example, showing BPP ^ BQP would immediately imply that P ^= 
PSPACE. Another reason might be that, through exactly formalized it is, the class BQP 
is somehow counterintuitive and hard to work with. Intuitively, it would be easier to find 
relations between two quantum classes than between one conventional and one quantum 
class. A possible approach to go round these problems and still provide a better insight to 
the complexity of the class BQP is to define a new bunch of classes somehow inspired by the 



or more precisely an algorithm based on a different idea which is here the quantum Fourier transform 
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BQP class, place them to the conventional class hierarchy and then show some interesting 
properties of them together with their relation to BQP. One of such bunches is suggested 
here. 

The aim of this paper is to first review the model of quantum computation, both from 
the view of Quantum Turing machines and the view of Quantum circuits. The relation 
between those two is sketched and also the phenomenon of entanglement and its importance 
are discussed. The main work comes after that. We introduce new quantum complexity 
classes related to the BQP class and show their properties. 

This thesis is organized as follows: In chapter [21 we define the quantum Turing machine 
and show some of its properties. In the next chapter, we introduce quantum circuits. Chapter 
0]shows the relations between quantum circuits and quantum Turing machines. We focus on 
comparing the power of the two models. The main results of this work show up in chapter 
Therein, we define new complexity classes and show some interesting properties of them. 
The new hierarchy of classes is illustrated in section lBTfl Figure l5"?l together with some other 
well-known complexity classes. 



In the text, we assume that the reader is familiar with basic linear algebra and with basic 
notion of complexity classes and Turing machines. We do not assume any kind of physics 
knowledge. Those, who are interested in the notion of Quantum Computing from a computer 
scientist point of view, can start from the beginning with chapter [21 Those, who are already 
familiar with Quantum Computing and are only interested in the new bunch of classes, can 
refer to chapter 



Chapter 1 

Preliminaries 



In this chapter we summarize definitions and theorems known in linear algebra that are used 
in this text. 

1.1 Hilbert space 

We will need these spaces to be able work with quantum gates and quantum states. A Hilbert 
space, through its name may suggest otherwise, is a natural notion of a space that has all 
the nice properties we imagine and many times take for obvious. We need to have an inner 
product defined in order to measure angles, to have a norm defined in order to choose only 
unit norm vectors and we need that space be complete, because we do not want to slip out 
of it when doing a limit. We formalize these properties bellow. 

Definition 1.1 (Inner product). An inner product over a vector space V is a complex 
function of two arguments denoted by ■ and satisfying 

1. Vx e V : x ■ x > 

2. x ■ x = x = 6 

3. Vx, y, z € V : (ax + by) ■ z = a(x ■ z) + b(y ■ z) 

4. Vx,yEV:x-y=(yx)* 

Each inner product induces a norm \x\ given by \x\ = yjx ■ x 

Definition 1.2 (Inner product space). An inner product space is a vector space where 
an inner product ■ is defined. 

Definition 1.3 (Perpendicular vectors). If for two vectors x,y e V for some inner 
product space V, it holds that x ■ y = 0, we say that x is perpendicular to y and write x _L y. 

Definition 1.4 (Complete space). A space H with a defined norm \x\ is complete under 
that norm iff for each sequence {xi}'^ 1 such that X{ € H and Ymin^^^Xn — x m \ — 0, it 
holds that lim^oo x n — x e H. 

Definition 1.5 (Hilbert space). A Hilbert space is an inner product space that is complete 
under the induced norm. 

Definition 1.6 (Dual space). A dual space to an inner product vector space V is a space 
V D spanned by linear functionals x D : V — > C given by x D (y) = x ■ y. The space V D is also 
an inner product space, with an inner product defined as (x D ■ y D )(z) °= r (x ■ y). 
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Dirac notation Wc write \x), the so called "ket- vector" to emphasize that x is an element 
of a Hilbert Space. (x|, so called "bra", is element of dual Hilbert space. The inner product 
of x and y is symbolically written as (x\\y), which is usually shortened to (x\y). If we take 
a linear operator O and apply it onto a ket |x), we get a new vector \Ox) which we denote 
as 0\x). We can then write the inner product of y and Ox as (y\Ox) = (y\0\x). If both 
x and y are base vectors, then the expression (y\0\x) is called a "matrix element" of the 
operator O, and also denoted as O x . v . Having specified all its matrix elements, we have 
specified the linear operator itself. Thus, in the text, we will use the terms "operator" and 
"matrix" interchangeably, using the fact that we talk only about linear operators, since only 
these occur in quantum mechanics. 

Definition 1.7 (Superposition). Let us have a Hilbert space H and its base {f =1 |ej)}. 
Then each vector v from H can be written as v — aj|ej). If more than one coefficient 

di is nonzero, we say that v is a superposition of corresponding e{ 's. Often, we use the word 
"superposition" to indicate that v is not a base vector. 

1.2 Tensor product 

Definition 1.8 (Tensor product of two Hilbert spaces). A tensor product of two Hilbert 
spaces U and V is a vector space W, denoted asW — U®V, spanned by all possible pairs 
of vectors 

{u® v\u eU, v eV} 

Its base Wb is 

W b = {u <g> v\u eU b , v e V b } 

where Ub,Vb are bases of U and V respectively. The space W is again a Hilbert space, with 
an inner product ■ defined as 

a ® b ■ c <g> d = (a ■ c)(b ■ d) 

This also automatically defines a tensor product of two vectors and a tensor product of two 
linear operators. 

Many times in this text, we will omit the x sign and write \ab) instead of \a) ® |6) where 
no confusion may occur. We illustrate the notion in the following example: 

Example 1.1 (Tensor product). Let us have two Hilbert spaces U and V with bases 

t4 = {|0),|l» 
H = {|0'>,|1'» 

Then the space W = U (&V is spanned by base 

W 6 = {|00'),|01 , ),|10 , ),|H'>} 
If we have vectors u G U, v G V defined as 

|u)=o|0)+6|l) 
|«)=c|0')+d|l') 

the vector w = u ® v <G U <S>V reads 

w = oc|00') + ad|0l') + 6c|10') + 6d|ll') 
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If we have two linear operators A and B defined on spaces U and V respectively, with 
their matrix representations Aij and Bij, their product C — A® B working on space U <£> V 
has matrix B^^l) = MkBji- For example, if 



and 



then 
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1.3 Qubits and gates 

Quantum mechanics tells us that a state of any quantum system is always described by a 
unit norm ket vector. Thus, all vectors we will work with will have a unit norm: 

Vx : (x\x) = 1 

Definition 1.9 (Qubit). A qubit is a unit norm element of a Hilbert space of dimension 2. 
If we label the base vectors |0) and |1), a qubit is of the form a|0)+/3|l) where \a\ 2 + \f3\ 2 = 1. 

Definition 1.10 (Hermitian conjugate). A hermitian conjugate of an operator O is such 
an operator for which it holds: 

Vx,y:{x\Oy) = (0^x\y) 

Definition 1.11 (Inverse operator). An operator O^ 1 is inverse operator of operator O 
iff 

Vx : \x) = lO^Ox) = lOO^x) 
Definition 1.12 (6 matrix). The S matrix is defined as 

.i,. {' ' /; 

I otherwise 

Many times, we will write the 8 matrix simply as 1. 

Definition 1.13 (Separable matrix). A matrix M is separable if it can be written as 
M = A® B for A and B of dimension at least 2. 

Quantum mechanics also tells us, that the time evolution of a quantum system is unitary. 
Thus, in this text, we will encounter only unitary operators. 

Definition 1.14 (Unitary operator). An operator O is unitary iff = O -1 . 

Observation 1.1. Unitary operators preserve the inner product. 
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Proof. We have 

Vx,y : (Ux\Uy) = (rfUx\y) = (U^Uxly) = (x\y) 

□ 

Lemma 1.2 (Conditions for unitarity). A matrix U is unitary iff all its columns and 
rows have a unit norm and are mutually orthogonal. Formally: 

52(k\U\ yi )*(k\U\y 2 )=6 Vim 

k 

Proof. We need to show that: UW = WU = 1. We have 

<W - 5>|E/|yi)*<fc|E/|y 2 > = 5>i|t/t|fc><fc|£/|y 2 ) = ( yi \U^U\y 2 ) 

k k 

and thus WU — 1. This leads to 

UU 1 = (UU^)UU- 1 = U{U^U)U~ l = UU' 1 = 1 

□ 



Chapter 2 

Quantum Turing machine 



In this chapter, we will treat the quantum Turing machine and compare its properties with 
those of classical Turing machines. First, we will review the classical Turing machine and its 
basics properties. Then, we will jump to its generalized version: the PTM. At last, we will 
define a quantum Turing machine in an analogous way and point out at the differences to 
the other types of Turing machines. 

2.1 Classical Turing machines 

The Turing machine serves as a model of a general computational device. As the Church- 
Turing thesis says, anything a realizable physical device can compute, the Turing machine 
can compute. The basic one, classical deterministic Turing machine is the simplest of all 
Turing machines. It behaves according to well defined strict rules, which is also what our 
conventional computers do: They process an exact program stored in a memory, step by 
step. On the other side, quantum computers, if ever built, will be able to do much more 
things. This obviously means that also the model of such computers, the quantum Turing 
machine will be somehow different. To show the logic of its definition, we will first start with 
the classical Turing machine and modify it step by step to finally arrive at the definition of 
the quantum Turing machine at the end of this chapter. 

2.1.1 Classical deterministic Turing machine 

A Turing machine is a device that moves along a possibly infinite tape. Is is equipped with a 
head that can read symbols from the tape and write symbols to the tape. The set of possible 
symbols is finite. The tape is divided into cells and the head can move to the neighbor cells 
in both directions (left or right) in a single step. The head also has a finite set of states. The 
current state of the head and the symbol read from the current position determine the state 
the head will turn to, the symbol it will write to the tape and the direction it will move to. 
The behavior is encoded in a transition function 5: 

8(a,q) = (a',q',D) 

where 

a is the symbol read from the current position on the tape 
q is the state the head is in 

a' is the symbol that will be written to the current position on the tape 



q' is the state the head will turn to 
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D is the direction the head will move to, its either D = L standing for "left" or D = R 
standing for "right" 

Since we have only a finite set of possible symbols and a finite set of possible states, the 
definition of the 8 function is also finite. The only infinite thing is the tape. That corresponds 
to our experience: We want to have a device that can handle inputs of possibly unbounded 
size without a need to alter the device in any way. 

Initially, the input is written on the tape and the head is in its special initial state. Once 
the task is done, the machine stops. This happens when the 5 function is not defined on the 
corresponding symbol and state. Formally, the Turing machine is defined as follows: 

Definition 2.1 (Turing machine). A Turing machine (TM) is an ordered seven-tuple 
(E,A,Q,qi,A,F,S), where 

E is a finite set, called "alphabet", of all possible tape symbols. 

A £E is an identified symbol called a "blank symbol" 

Q is a finite set, called "set of states" 

q% € Q is the initial state 

A C Q is a set of accepting states 

F C Q is a set of final states 

i:SxQ^ExQx {L, R} is a transition function. 
It operates on a tape of cells indexed by Z. A Turing machine internally stores: 

• Qcurrent € Q, initialized to q t 

• integer i, initialized to 0. i is the position on the tape which will be read in the next 
step. 

A computation consists of periodical repeating of steps that are determined by the 6 function. 
A Turing machine is said to halt when it reaches a state from F. If it halts in some state in 
F n A, it is said to accept. 

Usually, a Turing machine has various tapes, not only one. However, its known that each 
multi-tape Turing machine can be simulated by a single-tape Turing machine with no more 
than a polynomial slowdown. Therefore, we restrict ourselves to single-tape Turing machines. 

A Turing machine can be treated either as an acceptor or as a transducer. An acceptor 
is a machine that does not write any output onto the tape, the only information it tells us 
is the state it halted in. A transducer, in a contrary, writes the output onto the tape, while 
the state it halts in may be arbitrary. Formally: 

Definition 2.2 (Transducer). A transducer is a Turing machine that uses its tape as an 
output tape. When the machine halts, the output is written on the tape, beginning from the 
cell indexed by zero to the first blank symbol on the right. A transducer thus computes a 
function / : (S — A)* — > (S — A)*. If the transducer does not halt on some inputs, the 
function is not defined there. 

Definition 2.3 (Acceptor). An acceptor is a Turing machine that does not write any output 
onto the tape. When it halts, it either accepts or not, depending on whether the final state is 
in the set A. 

Definition 2.4 (Accepting a language). We say that a Turing machine M accepts a 
language L if it always halts and for each input x: 

if x G L then the machine M accepts on input x, 



2.1 Classical Turing machines 
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if x £ L then the machine M does not accept on input x. 

We then write L = L(M). 

We have said that the number of states and the number of possible symbols and so also 
the transition functions are fixed and do not change with the length of the input. However, 
with the increasing lengths of the input, the number of computational steps may increase 
and so the number of cells the machine visits. To capture this, time and space complexity of 
a machine is defined. 

Definition 2.5 (Time complexity). Let T(n) be a junction T : N — > N and let M be a 

Turing machine that on each input of length n proceeds at most T(n) steps before halting, for 
some function T(n). We then say that M has time complexity T(n). 

Definition 2.6 (Space complexity). Let S(n) be a function S : N — > N and let M M be 

a Turing machine which on each input of length n uses (writes to and reads from) at most 
S(n) cells before halting, for some function S(n). We then say that M has space complexity 
S(n). 

Definition 2.7 (Configuration of a Turing machine). A configuration of a Turing 
machine is an ordered set of: 

- the contents of the tape 

- the current state 

- the position of the head 

The set of all configurations of a Turing machine M will be denoted by C{M). 

It is sometimes useful to think of a Turing machine as of a transition matrix T, trans- 
forming configurations into each other. If a configuration c\ leads to another configuration 
c 2 in the next step, there is 1 on the position T C2jCl . 1 Otherwise there is zero. Because 
the tapes are infinite, the matrix is infinite-dimensional. However, if we know that the time 
complexity of a Turing machine is T(n), we may for fixed n have a finite dimensional matrix 
cutting the tapes at the distance T(n) from the initial position on both sides. The size of 
the matrix is then \Q\ ■ (2T(n) + 1) • |E| 2T (")+ 1 which is the number of configurations on a 
tape of length 2T(n) + 1. 

Definition 2.8 (Transition matrix). Let us have a deterministic Turing machine with 
a time complexity T{n). A transition matrix for the length n is a sguare matrix T of size 
IQI • (2T(n) + 1) • |S| 2T (") +1 x IQI • (2T(n) + 1) • |£| 2T (") +1 where 

D ef J 1 if the machine goes from Cj to Ci in one step 
Ct ' Cj 10 otherwise 

It should be pointed out that for most matrices containing only zeros and ones, no cor- 
responding Turing machine exists. The reason is that each Turing machine has a finite 
description of bounded size. A constant size description can only generate constant amount 
of matrices. Using this definition, each deterministic Turing machine can be expressed as a 
family of transition matrices T\,T%, . . . where Tj is a transition matrix for length i. 

2.1.2 Deterministic Turing machines with oracles 

Oracles are abstract devices answering yes-no questions in a unit time. Its natural to ask 
then, if we supply a Turing machine with this kind of information, what more can it compute? 
We will first formalize the situation of a Turing machine with oracle. 

1 Notc that the indices are in reverse order. 
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Definition 2.9 (Oracle). An oracle over an alphabet £ is an arbitrary set of words from 
this alphabet, which we can ask whether any x G S* is there and get the answer in a unit 
time. 

Naturally, one would expect that an oracle Turing machine can do two different things: 
It either undergoes a transition given by its 6 function or asks the oracle and accordingly 
changes its state. Nevertheless, to grasp this, we would have to change the definition of the 
8 function. Because of this, a more common approach is to introduce a special query state 
in which the oracle writes to the tape the answer for the word it finds on the tape and then 
the machine turns to another special post-query state. Obviously, the two approaches are 
equivalent. We review the second approach kind oracle Turing machine for completeness. 

Definition 2.10 (Oracle Turing machine). An oracle Turing machine is an ordered ten- 
tuple (£, A, Q, qi, A, 5, q q , q a , y, n), where the first six items are the same as in the Definition 
XFRand 

q q G Q is a special query state 

qa G Q is a special post-query state 

y G S is a special 'yes ' symbol 

n G £ is a special 'no ' symbol 

It works with any oracle that has the same alphabet. When the machine enters the query 
state q q , it asks the oracle for the word written on the tape beginning from the actual position 
to the first blank symbol on the right. Then the answer is written onto the actual position, 
the 'yes' symbol if the oracle answers "yes" and the 'no' symbol if the oracle answers "no". 
Then the machine enters into the post-query state q a and continues the computation as usual. 

2.1.3 Probabilistic Turing machine 

Definition 2.11 (Probabilistic Turing machine). A probabilistic Turing machine is a 
slightly modified Turing machine. Its transition function does not output only one triple of 
symbol written, new state and direction to move, but outputs a probabilistic distribution of 
such triples. 2 If a probabilistic Turing machine halts, each of possible outputs has a certain 
probability. We say that a probabilistic Turing machine produces a certain probability distri- 
bution. If a probability of halting in a state in F(~)A is p, we say that the probabilistic Turing 
machine accepts with probability p. 

If we look at the finite version of transition matrix here, we see that it does not have 
only Is and Os, but may contain any number from the interval [0, 1]. It is a stochastic matrix 
where each column sums up to one. Instead ofi:ExQ^ExQx {L, R}, we will here 
talk about i:QxSxQxSx {L,R} — > M + . We define 5(q2,o-i,q 2 ,o-2,D) °= a iff the 
probability of going from the state q±, having read the symbol o~x, to the state q 2 , moving to 
the direction D and writing the symbol o~ 2 is a. 

Definition 2.12 (Accepting a language on a probabilistic Turing machine). We say 

that probabilistic Turing machine M accepts language L with completeness c and soundness 
s iff over an input x 

for x € L, M accepts with probability p > c, 

for x L, M accepts with probability p < s. 

Then we write L = L(M, c, s) 

2 Strictly speaking, it chooses among different possibilities how to continue the computation, each having 
certain probability. 
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The following lemma tells us, that we may restrict ourselves to cases where there are 
only two computation paths possible, each with the probability one half. It can be found for 
example in |12j . 

Lemma 2.1. For each probabilistic Turing machine M with time complexity T(n) and each 
e > 0, there exists a probabilistic Turing machine M' with time complexity T'(n) such that 
its possible probabilities are from the set {0, i, 1} and for each language L and numbers c, s 
it holds L(M, c, s) = L(M' , c — e, s + e) and T'(n) = p(T(n)) for some polynomial p(n). 

At each step of a probabilistic Turing machine where there are two possible computation 
paths, we may imagine that the machine behaves deterministically, but uses one random bit. 
At each computation of a time complexity T(n), it may use at most T(n) random bits. This 
leads to the following characterization of languages: 

Observation 2.2. For each language L, a probabilistic Turing machine M with time com- 
plexity T(n) and numbers c,s, such that L — L(M,c, s), there exists a deterministic Turing 
machine Md such that 

For x S L, PyAcngth( y )=T(n) (M d (x , y) accepts) > c. 

For x £ L, Py.i cngi th(y)=T(n) (M d (x , y) accepts) < s. 

where Py.\ en gth(y)=T(n) * s the probability over all y 's whose length is T(n). We may then write 
L = i((M d ,length(T(n))),c,s). 

We put the definition of a nondctcrministic Turing machine here for sake of completeness. 
Having already defined a probabilistic Turing machine, the nondctcrministic Turing machine 
can be thought of being its special case. 

Definition 2.13 (Nondeterministic Turing machine). A nondeterministic Turing ma- 
chine differs from a probabilistic Turing machine only in the condition of acceptation: We 
say that a nondeterministic Turing machine M accepts a language L iff 

For x £ L, M accepts with probability p > 0. 

For x (fc L, M accepts with probability 0. 

Then we write L — L(M). 

2.2 Quantum Turing machine 

A formal definition of a quantum Turing machine was first given in 0. Here we will refor- 
mulate the definition to obtain a more convenient form. When defining a quantum Turing 
machine, we cannot straightforwardly apply the previous definitions of halting onto the quan- 
tum case. The quantum mechanical nature tells us the time evolution of a quantum system 
has to be unitary. In other words, if we express the state of a given system as a vector 
from a Hilbert space, each subsequent state of this system is obtained by applying a unitary 
operator to that vector. The operator is given by the concrete physical situation. 

If we think of configurations as of a base of a Hilbert space, we get the condition that 
the transition matrix has to be unitary, because its in fact an operator that tells us in which 
state the system (here the machine and the tape) will be in the next step. 

Definition 2.14 (Quantum Turing machine informally). A quantum Turing machine 
(QTM) is similar to a probabilistic Turing machine, with the following differences: Here the 
coefficients are not probabilities, but complex numbers called amplitudes. At each step, the 
squares of norms of amplitudes of possibilities sum up to one. For each length of input, the 
corresponding transition matrix has to be unitary. 

We say the machine halts if all branches enter the final state. The output is then written 
on the tape, beginning from the initial position to the first blank symbol. The probability of 
outputting a configuration is a squared norm of the corresponding amplitude. 



10 



Quantum Turing machine 



It should be emphasized that the transition matrix now contains complex numbers. Now, 
we may think of configurations as of orthonormal base vectors of some Hilbert space H. 
Then, at each step the S function output corresponds to some vector v from H, v — ^2 &iCi, 
where Cj are configurations (so base vectors), and a; are corresponding amplitudes. The 
definition of quantum Turing machine tells us that the sum of norms of a,, a* at has to be 
1. We will thus work only with unit norm vectors. We will also take a profit of the Dirac 
notation and write 

fc 

to indicate that from configuration c, there are k possible resulting configurations Cj, each 
with respective amplitude ai. Now, we can define quantum Turing machine more formally 
as: 

Definition 2.15 (Quantum Turing machine formally). A quantum Turing machine is 
an ordered six-tuple (S,A,Q,qi,qf,8), where 

£ is a finite set, called "alphabet" , of all possible tape symbols. We assume it equals 
{0,1, A}. 

A G X is called a "blank symbol" 
Q is a finite set, called "set of states" 
qi is the initial state 
qj is the final state 

S : £ x Q — > H is a transition function and H is a Hilbert space spanned by base vectors 
corresponding to triples from £ x Q x {L, R}, 

and the corresponding finite transition matrix is unitary for all lengths of input. 

The acceptation of a language on a quantum Turing machine has to be redefined, due to 
the change in the interpretation of probabilities: 

Definition 2.16 (Acceptation of a language). We say that quantum Turing machine M 
accepts language L iff there is a configuration a{x) such that for each x the machine M halts 
in a configuration a,i\ci) such that 

IfxeL then £\ a*] (a(x) |cj) | 2 > § 

Ifx$L then £1*1(^)1^) I 2 < i 

The function a(x) in the definition above tells us, which configuration will indicate x G L 
for each x. Then we want that the probability of outputting such configuration is bigger 
than two thirds for x G L and smaller or equal to one third for x ^ L. 

As for deterministic and probabilistic Turing machines, it was shown that we suffice with 
rational entries in the transition matrix, here thus amplitudes. 

Lemma 2.3 (Constant number of amplitudes). For each quantum Turing machine 
M with time complexity T(n) there exists a quantum Turing machine M' that has only the 
numbers 0,±|,±|,1 as amplitudes, accepts the same language and its time complexity is 
p(T(n)) for some polynomial p. 



Proof. In Q]. 



□ 



2.2 Quantum Turing machine 
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2.2.1 Oracle quantum Turing machines 

As in the classical case, we will want to provide quantum Turing machines with oracles. 
However, to incorporate an oracle into the notion of a quantum Turing machine, we need 
that the query process does not violate the unitarity condition (see Def. I2.15fl of a quantum 
Turing machine. That means that the the computation has to be reversible - no information 
can disappear from the system. This for example implies that the answer cannot be written 
over a non-blank symbol that was not previously copied to somewhere else etc. We will thus 
assume that oracle changes the current bit if the answer is "y es " an d leave in unchanged if 
the answer is "no". In other words, we will assume that if the contents of the tape to the 
first blank symbol on the right is \x, 6), then in the post-query state it is \x, b © I[x)) where 
I(x) = 1 if x G O and I(x) — otherwise. For sake of completeness, we also write here the 
formal oracle quantum Turing machine definition. 

Definition 2.17 (Oracle quantum Turing machine formally). A Turing machine is 
an ordered eight-tuple (S, A, Q, tfo, qf, S, q q , q a ), where 

£ = {0, 1, A} is a finite set, called "alphabet". 

A G S is called a "blank symbol" 

Q is a finite set, called "set of states" 

qi is the initial state 

qf is the final state 

5:ExQ^(ExQx {L, R})^'^' 2 is a transition function 

q q G Q is a special query state 

qa S Q is a special post-query state 

and the corresponding finite transition matrix is unitary for all lengths of input. When the 
machine enters the state q q , let the contents of the tape from the current position to the first 
blank symbol be \x, b) , where b is a single qubit. Then the oracle transforms the state to 
\x,b® I{x)), where I(x) is the indicator of x and the machine enters the post-query state q a . 

2.2.2 The quantumness of a quantum Turing machine 

The natural question is now: "Where is the quantumness of a quantum Turing machine 
hidden, if there ever is any?" In other words, what property makes the quantum Turing 
machines possibly more powerful? 

As for example suggest, it is the following feature of quantum Turing machines: Two 
different paths may interfere both constructively (if their amplitudes are equal) , or destruc- 
tively, if their amplitudes are opposite. Here we justify it by the following by the following 
observation: 

Observation 2.4. // all amplitudes in a quantum Turing machine M are real positive, then 
there exists a Turing machine accepting the same language. 

Proof. From the assumption about amplitudes, it follows that in the transition matrix of M , 
there are only real positive numbers. That means that as the entries there are only Is and 
0s. Let us assume the opposite and have a row r with two or more nonzero entries, which 
can be, without loss of generality, symbolically written as 



k 




(2.1) 
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where k > 2 and e» are base vectors 3 and a, are real positive coefBcients. Then there must be 
at least one more row r' that has a nonzero coefficient, let us say b\, on the first place, because 
otherwise the first column could not have a unit norm. This implies that rr' > ai&i > which 
violates the unitarity condition so we have arrived at a contradiction. Thus, the transition 
matrix is a permutation matrix and the machine M now works as a classical Turing machine, 
since all squares of amplitudes are equal to 1. □ 



of the same base where the matrix is written 



Chapter 3 

Quantum circuits 



In this chapter we will define quantum circuits and show their strengths or weaknesses com- 
paring to classical Turing machines. We will not talk about quantum Turing machines, this is 
left to chapter^] We will however not introduce the concept of classical circuits, which would 
be at the first sight a more natural model to compare the quantum circuits with. The reason 
is that the quantum circuits have some important intristic properties different form those of 
classical circuits and so comparing these two is unnecessarily complicated. Furthermore, we 
will only introduce uniform complexity classes based on circuits, as is the class BQP. 

Definition 3.1 (Quantum circuit). Let us have a two-dimensional Hilbert space H . Quan- 
tum circuit of width n in that space is a unitary operator U of size 2™ x 2™ working on joint 
systems composed of n qubits from H, which are initially in some base state of H n . U thus 
realizes some function f : (base of H n ) — > H n . If we identify the base vectors of H , |0) and 
with logical and 1, and the output with some probability distribution by squaring the 
norms of amplitudes, we may say that for each bit string of length n, the circuit outputs some 
probability distribution over all bit strings. We may also treat some subsystems as auxiliary, 
in that case we forget about them when evaluating the output. 

A circuit of width n is visualized in Figure l3~T1 The horizontal lines are called wires. The 
joint system we work with and also the respective subsystems are called a quantum register. 

The unit M, a so called measurement operator is there to emphasize we look at the 
output as at a probability distribution and forget the complex values of amplitudes. This is 
in accordance with quantum mechanics which tells us that after being observed, a quantum 
system collapses to one of the basis vectors with a probability corresponding to its amplitude. 1 

The computational power of classical computers is expressed by Church- Turing thesis: 
"Any computational device can be simulated by a Turing machine." It also holds that 

Here we silently assume the measurement is done in the same basis as we work in. 



|(A 4 > 
l<A 5 > 



Figure 3.1: A visualization of a quantum computer. U is a unitary matrix and M is a measurement 
operator. U is applied onto \ipi) ® \ipi) <g> \ip2) ® . . . ® \ip§) and to the result a measuring operator M 
is applied. 



u 



M 
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Lemma 3.1 (Power of quantum circuits). For any Turing machine T that halts for all 
inputs and any bit string I , there exists a unitary matrix U , whose action result on the I 
encoded into a quantum register leads to the same output as of T on I. 

Proof. Each Turing machineM realizes some computable function x —* M(x). Each such 
input and output can be encoded into bits and subsequently into qubits. A matrix U realizing 
transformation (x,0) — > (x,M(x)) is certainly unitary, since we have that 

(x,0)±(y,0) x±y <=► (x, M(x)) ± (y, M(y)) 

This means that we do not start the algorithm only with a register containing x, but add 
another register containing only zeros, which are usually called "padding" zeros. If we did 
not use them, the transform realizing x — > M(x) would not generally be unitary. □ 

We see that computational power of a quantum circuit is superior or equal to the one of 
a classical Turing machine. 



3.1 Universal sets of gates 

When working with quantum circuits, we arrive at a need to define a complexity measure. 
With n wires, the matrix U in Figurc mi has a size of 2™ x 2". Thus, the size of the matrix can 
not be a good measure of complexity, since all algorithms would have exponential complexity. 
Furthermore, if we have U = 1 we are practically doing nothing and the complexity should be 
0, while the size of that matrix remains exponential. Thus, the complexity measure should 
somehow reflect how complicated the structure of the matrix is. Also, it should hold that 
matrices of a low complexity can be practically realized easier than those with a higher 
complexity. Furthermore, if we work with fewer qubits, the corresponding complexity should 
lower. Thus, 

Requirement 3.1.1. IfU = U-JJi then its complexity measure C should be C(U) < C(Ui) + 
C{U 2 ) 

If U = C{U\ £§> U2) then its complexity measure should be C(U) < C(U±) + CiJJz). 

The requirement is illustrated in figure I5~2l Now, if we choose some set of gates, called 





Figure 3.2: A composite unitary matrix. The left picture shows U = Uit&Uz and the right U = U\Ui- 
It should be pointed out here again, that the size of a gate working on n qubits is always 2™ since 
it transforms an n-qubit string into another, of which there are 2" possible mutually orthogonal 
combinations. And since its unitary, such a matrix is always square-shaped. Thus, the width of the 
gates in the figure is irrelevant. On the left, we have two gates of sizes 2 2 and 2 4 respectively, while 
on the right, we have two gates of size 2 6 . 



elementary set, and set the complexity of gates from that set to 1, we can construct upper 
bounds of the complexity using the Requirement 13.1.11 To proceed, we introduce some 
definitions. 



3.2 Some important gates 
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Definition 3.2 (Universal set). A set of quantum gates is said to be universal for quantum 
computation if any unitary operator can be expressed as a circuit involving only gates from 
that set. 

Many times, when we are dealing with probabilistic quantum algorithms, we may allow 
for small differences in the gates. We may suffice with a gate that is almost equal to the gate 
we want. 

Definition 3.3 (Approximately universal set). A set of quantum gates is said to be ap- 
proximately universal for quantum computation if any unitary operator can be approximated 
with arbitrary precision by a circuit involving only those gates. 2 

To proceed further, we will define some important gates. 



3.2 Some important gates 

It was shown that any unitary gate can be built up from one and two-qubit gates. This fact 
is important for an eventual quantum computer, since such gates are easier to prepare and 
manipulate. Here we will define some of them, which wc will need later. 



One qubit gates 

The Hadamard gate is defined as 



"-Ml -I) 

It acts on a ket vector a\0) + b\l) as 

H { a\0) + b \l)) = -L({ _;)(«U) +fe (!)) = 

= -L((a + &)( J ) +(a-b){ J )) =-^((a + 6)|0) + (a-6)|l)) 

The Hadamard gate is of a great importance, since a tensor product of n Hadamard gates 
can be used to prepare n-qubit quantum register in a linear superposition of all 2™ base states 

li): 

(n-1 \ n-1 . n-1 2" 

(g) H |0") = (g) H\Q) = -= (g(|0) + |1)) = ]T |j> 

i=0 ) i=0 V i=0 V j=0 

Notice that we have needed only n Hadamard gates to prepare a superposition of exponen- 
tially many -2™ - elements. This feature is of a great use in quantum algorithms discussed 
later on. 



A phase shift gate is for any ffel defined as 
and acts acts 

P(fl)(«|0)+6|1))= (J e » )(«( J )+»(?)) = 

-(■(;)+*•(:))■ 

2 Sincc wc say "arbitrary precision" here, we do not bother about the metric we use to define the distance 
between the matrices and thus the precision. It can be just any metric, if the word arbitrary means "any 
nonzero value in the metrics range" . 
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Two-qubit gates 

CNOT gate 



CNOT = 



It acts on a two-qubit state a|i 
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An important property of CNOT gate is that it is inseparable, e.g. it cannot be written as 
a product of two one-qubit gates. 

Now, we are ready to introduce some universal sets. In 0, elementary sets are presented. 
We will suffice with the following group: 

Lemma 3.2. The following sets of gates are universal: 

- For a constant k, set of all gates of size k x k. 

- All one-qubit gates and CNOT 

- {P{9) 1 CNOT,H} 

One of the most widely used universal sets is CNOT together with all one qubit gates. 
We often treat this set as elementary and use it to measure complexity. In this case, it has no 
sense to distinguish subpolynomial complexities. Nevertheless, many different small subsets 
of this elementary set, like {CNOT, H, Phase(^)} are approximately universal. Here, when 
having only three elements, we can measure complexity precisely. The universal set is on the 
other side more comfortable to use when working with complexity classes up to polynomial 
reductions. 

It turns out that the complexity of a general unitary transform is most likely exponential. 
In other words, a general unitary gate working on n quantum registers can not be composed 
from less then fi(4™ -2 ) two-qubit gates. The argument is simple: 

Lemma 3.3. For each n there exists a gate of size 2" x 2™ working on n qubits, such that 
complexity of that gate is 0(4™~ 2 ). 

Proof. A unitary gate of size 2™ x 2" has 2 2 ™ = 4" free angle parameters. A two qubit 
gate has 2 2 ' 2 — 4 2 parameters. A number of free parameters does not change when doing a 
tensor product with 1. To collect 4™ free parameters, we need at least 4a = 4™~ 2 two qubit 
gates. □ 

The lemma suggests that most of quantum algorithms will have too high a complexity to 
be of any use. Due to another result in 0, any two qubit gate can be expressed by four one 
qubit gates and two CNOT gates, we don't have to look at their concrete form of one-qubit 
and two-qubit gates we use, but only at their dimension. The following lemma that appeared 
at |3] shows, that the lower bound is close to the upper bound: 



Lemma 3.4. Each unitary matrix of size 2 r 
qubit and CNOT gates. 



x 2 n 



be decomposed to at most 2°^ 



3.3 Entanglement 
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Using the notation from the theory of classical circuits, we rather refer to a size of a 
circuit instead of complexity: 

Definition 3.4 (Size of a circuit). A size of a circuit C with respect to an universal (resp. 
approximately universal) set of gates is the complexity with respect to that set. 

If we are talking about sizes up to a polynomial, we may omit to name a concrete set of 
gates, thanks to lemma EOl provided we have in mind that we should refer only to universal 
sets of gates of bounded size. 

Now, we will define how can quantum circuits accept languages. The function a(x) has 
the same role as in the definition 12.161 It tells us which configuration is the accepting one 
for each x. Each circuit obviously works only on inputs with some predefined size. 

Definition 3.5 (Acceptation of a language). We say that a quantum circuit C of width 
n working on Hilbert space H accepts language L iff there is a function a(x) : (base of H) — > 
base of H such that for each x of length n it outputs a state ai\ei) such that 

IfxeL then £\ a t \ (a(x) \e t ) | 2 > § 

Ifx^L then Y.MH^ei)? < \ 
where {e^} is a base of H. 



3.3 Entanglement 

The phenomenon of entanglement has been extensively treated by many scientists (see for 
example [5]) and also mystified in many articles. It arises as follows: A quantum system 
composed of n qubits lives in a Hilbert space of dimension 2 n , generated by for example the 
following base 

2 „-i 

U 

On the other side, a linear space generated by vectors of length n with complex coefficients 
has dimension n and can be injectively mapped onto a proper subset of that Hilbert space. 
The mapping works as 

n 

(ci, 02, . . . , c n ) «-> (g) - |ci| 2 |0) + Ci\l)) (3.2) 

i=l 

and is a natural mapping of complex vectors onto qubit states. The state of the i— th qubit 
straightforwardly corresponds to the i— th component of the complex vector. Such quantum 
states are called disentangled or separable states. The complement of that subset to the entire 
Hilbert space is filled by entangled states. This is where the mysterious spirit of entangled 
states comes from. They don't have their counterparts in a complex space of vectors of length 
n and are thus maybe a bit counterintuitive. It can be seen that separable states generate 
the whole Hilbert space. It follows that any vector from the Hilbert space can be expressed 
as a sum of separable states. Before describing its magical features, we will state a simple 
observation. 

Observation 3.5 (The role of entanglement). Without entanglement, quantum computer 
is polynomially equivalent to Turing machine. 

Proof. In a quantum computer with n-qubit register without entanglement, at every stage 
of computation, the register is in a separable state. Thus, at every stage, the register's state 
can be described by a complex vector of length n. For each two vectors, there exists a general 
matrix realizing a transformation from one to another. Thus, the whole computation can 
be seen as a simple matrix multiplication. Each Turing machine can be written as a matrix 
multiplication and each multiplication can be realized on a Turing machine, with only a 
polynomial slowdown. □ 
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By further examination of properties of separable states, we may state a stronger version 

Observation 3.6. Without entanglement, quantum computer can be simulated on a Turing 
machine in linear time. 

Proof. Because the gates in the quantum computer (again with n qubits) can not lead to 
inseparable states, the gates have to be separable itself. That means the circuit is composed 
of only one qubit gates. They can be at most n. If they are more then 1 on a single wire, 
they can be multiplied up. Thus, the complexity of simulating the circuit is n times the 
complexity of multiplying a matrix of size 2 x 2 by a vector, which is constant. □ 

However, one should note that a quantum circuit with n gates has complexity at least n. 
Therefore, the complexities of both the machines are asymptotically equal. 

We have seen that for a quantum circuit to ever perform asymptotically better than 
classical computers, entanglement is crucial. If it ever happens that there is no entanglement 
in a quantum algorithm and still it performs more than linearly better than its classical 
counterpart, it means that the latter can be improved. 

3.4 Quantum query model 

As we let quantum Turing machines access additional information via oracles, we will also 
allow the circuits to access some kind of additional information. Here, in the model of 
quantum circuits, the oracle is picturized as a special type of a gate, called "black box" to 
emphasize we can not see its internal structure. It acts on registers of the form \x,b) as 

\x,b)0\x,y®I(x)), 

where b is a single qubit and I(x) is the indicator of x. This unit is thus effectively equivalent 
to the oracle unit used in oracle quantum Turing machines. As before, we need to assume 
one more thing comparing to classical black boxes: The quantum black-box needs to support 
superposition, e.g. to behave as a linear operator. The black-box model has turned out to 
be very fruitful for various results, see for example 6 . 
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In this chapter, we will show the relation between QTMs and quantum circuits. We will 
review the definitions of conventional complexity classes and define the class BQP. 

4.1 Relation between QTMs and Quantum circuits 

To ever talk about the relation between quantum Turing machine and quantum circuits, we 
have to define what simulation of respective things mean. 

Definition 4.1 (Simulation). We say that a quantum Turing machine Q simulates a quan- 
tum circuit C on an input I , if Q, fed with I , gives as output probability distribution identical 
to the one that C gives. 

We say that a circuit C with input I simulates a quantum Turing machine Q on that input, 
if they both on that input output the same probability distribution. 

To show the relation between quantum circuits and quantum Turing machines, we intro- 
duce the following observation: 

Observation 4.1. For each constant integer k and unitary matrix U of size 2 k x 2 k working 
on k qubits, there exists a quantum Turing machine that simulates circuit containing only U . 

Proof. That matrix can be thought as a function on an appropriate Hilbert space. For each 
constant k, such function can be realized on some quantum Turing machine. □ 

It is necessary that A: be a constant for the observation to hold, since for the simulation, 
the quantum Turing machine must have |E| + \Q\ of size at least log(2 k ) = k. 

Lemma 4.2. For each circuit of size n there exists a quantum Turing machinewith time 
complexity T{n) simulating that circuit such that T(n) = 0(n). 

Proof. Each gate of size 2 k can be simulated on a quantum Turing machineusing at most 
20(k) s ^ e p S> Thus, if the maximal size of gates in the universal set is m, the simulation takes 
time at most 2°( m >n = 0(n) steps. □ 

The converse relation between quantum circuit and a quantum Turing machine was proved 
at |16| . Here we present a slightly simplified version: 

Lemma 4.3. For each integer n and each quantum Turing machine Q of time complexity T, 
there exists a circuit of poly(n,T) elementary gates that simulates Q on any input of length 
n . 

Proof. For each of T steps of Q, a separate circuit is constructed. The machine cannot go 
further than to distance of T from the beginning position of the tape, so we cut off everything 
except the middle 2T + 1 cells. For each cell, we add I = (1 + fZog( |Q| + 1)1 + [Zo£f|E|] wires 
to the resulting circuit. These wires encode: 
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- the current state: |7o<7(|Q| + 1)] wires, where the 1 stays for a new state s (if the 
machine never reaches there) 

- the symbol on that cell: |7o<?|£|] wires 

- a bit indicating whether the head is on respective cell l=yes 

Then for each step and cell, we take the cell the head is just on and the two neighbors, together 
tree cells. This suffices, since in one step, the head cannot escape from them. Furthermore, 
we know that without loss of generality, we may not let the machine stay at the same state 
after a transition. Now we want to have a unitary matrix U that would transform the state 
on the 31 wires according to how the transition function i5 would do. Formally, we want that 

U\(n, ai,0)(q, a, l)(n, a r , 0)) = ^S(q, a, q' , a', L)\(q', a h l)(n, a', Q)(n, a r , 0)) + 

a',q< (4.1) 

S(q, a, q', a', R)\(n, a h 0)(n, a', 0)(a', a r , 1)) 

Luckily, it holds that for all distinct pairs of g, a, a r ,ai, the vectors 
U\(n, oj, 0)(<7, a, l)(n, a r , 0)) are mutually orthogonal: 

((s, an,0)(q 1 ,a 1 , l)(s, a rl , 0)\U + U\(s, a t2 , 0)(q 2 , a 2 , l)(s, a r2 , 0)) = 

( X! %i> a i>9i> a i> a *i> l)(s,ai,0)(s,a r i,0)| + 

%l, ai, ai, i?)((s, oji, 0)(s, a' l5 Q)(^, a rl , 
( X! (5 ( 2,a2,g 2 : a 2) i )l(9 2J a i2, l)(s,a 2 ,0)(s,a r 2,0)) + 

a' 2 ,q' 2 

<5(<72, a 2) g 2) 4: #)l( s ! a *2, 0)(s, a 2 , 0)(g 2 , a r2 , 1))) = 
^2 S (qi, a 1} q[, a[, L)S(q 2 , a 2 , q' 2 , a' 2l L) 

((?!, an, l)(s, a' 1; 0)(s, a r i, 0)|(g 2 , a i2 , l)(s, a' 2 , 0)(s, a r2 , 0)) + 

<5(gi, ai, gi, ai, R)S(q 2 , a 2 , q' 2 , a' 2 , R) 
((s,ai 1 ,0)(s,a' 1 ,0)(q' 1 ,a rl , l)\(s, a i2 ,0)(s, a' 2 ,0)(q' 2 , a r2 , 1)) = 

5 ail a l2 5 arl ,a r2 (^2 S {qi , «i , 9i , a[ , L)5(q 2 , 02, q[, a[, L) + 5 (q x ,a 1 ,q[, a[ , R)S (q 2 , a 2 , q[ , a[ , # )J 

= (4.2) 

Where the last equality follows from the unitarity condition of the quantum Turing machine, 
because in the transition matrix all pairs of rows are mutually orthogonal. The rest of base 
vectors can be thus set so that U is a unitary matrix. We add such matrices to each 3 
consecutive cells. The order does not matter thanks to the definition of U. This simulates 
one step of Q. To simulate T steps, we just chain T such circuits. 

We have together used 0(T(2T+ 1)) matrices of size 2 3t x 2 31 and 20(l(2T + 1)) wires. From 
Lemma 13.41 we know that such matrices can be realized by 2°'" elementary gates. In sum, 
we used (T(2T + 1))2°W = T 2 2°^ elementary gates. □ 

4.2 Complexity classes 
4.2.1 Classical classes 

For sake of completeness, we briefly review classical complexity classes definitions. 



4.2 Complexity classes 
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Definition 4.2 (Classical polynomial time complexity classes). A language L is in 
class C if there exists a polynomial p(n) and a Turing machine M with time complexity p(n) 
such that 



complexity class C 


P 


NP 


PP 


BPP 


For i£i, Py:\y\ =p (n) (M(x,y) accepts) 
For x L, Py.\y\=p(n) (M (x , y) accepts) 


= 1 
= 


> 
= 


<t 


> M 

i 



where exactly one of the columns applies. 



The abbreviations stand for polynomial, nondeterministic polynomial, probabilistic poly- 
nomial, and bounded probability polynomial respectively. 

All of these classes arc £2 definable, which means that is one 3 and one V quantifier 
before a general predicate computable in the class P. That is considered to be a condition 
of a robust definition. 

In the future, we will need different numbers in the definition of the class BPP then | and 
|. Therefore, we will introduce the following two lemmas. The first is a wellknown bound 
on the tail of the distribution of a sum of random variables and can be found for example in 
|15j . The second, amplification of the accepting probability, can be found in many complexity 
books, for example in |12j . 



Lemma 4.4 (Chernoff bounds). Suppose x\. 



fixed distribution, 
following holds: 



Let the mean value of x be fj, 



X2, ■ ■ ■ x n are chosen independently from a 
E(x). Then for any positive A the 



N 



> A < e" 



Lemma 4.5 (BPP amplification). The numbers ^ and | in the definition of the class 
BPP may be equivalently changed to any a and b such that a > b + e for some e > 0. 

Proof. Let us have an algorithm that accepts x € L with probability a and x ^ L with 
probability b. We run the algorithm k(n) times for k(n) being an even polynomial and then 
accept if and only iff at least of the runs accepted. Now we define new quantities Zj in 
the following way: 

{1 if i-th run accepted 
otherwise 

For, x € L, we have that E(z) > 1 • + ■ (1 — a) — a and 

for x L, we have that E{z) < 1 • b + • (1 — b) = b. Now we are ready to use Lemma f4.4l 
We have 



/ ft a + b\ 

P(x £ L is accepted) = P I > k — - — J 

\<=i / 



< 



a — b 



l-P 



V fc x- 



a— b\ , ( a - b V 
> > 1 - er\— ) 



and 



P(x L is accepted) 




<P 



Now it suffices to define k(n) big enough so that e 



-b \2 k 



< 



> 



and 1 



/ a,-b \2 fc 

< e V 2 ; 2 



-b\2 k 



> 



3 and 3, we can reach any other pair a 



Conversely, if we start with the original numbers | 

and b satisfying the assumption a > b + e for some e > by setting the polynomial k(n) big 
enough so that e - ^) 2 < 5 an( j 1 _ e _ (s) I > a □ 
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Putting it together 



PSPACE 



PP 




P 



Figure 4.1: Hierarchy of classes. For each pair connected by a line, the class that stays upper contains 
the lower one. 



Definition 4.3 (PSPACE). A language L is in PSPACE if there exists a polynomial p(n) 
and a Turing machine M with space complexity p(n) such that L is accepted on M . 

4.2.2 BQP class 

Definition 4.4 (BQP). A language L is in BQP if there exists a polynomial p(n) such that 
L is accepted by some guantum Turing machine with time complexity p(n). 

Thanks to Lemma f4. 31 and Observation ^. II we may equivalently define BQP as 

Definition 4.5 (BQP). A language L is in BQP if there exists a function f(n) and poly- 
nomials p(n),q(n) such that for each n, output of f(n) is a circuit C of width n and size 
p(n) such that the language L n = {x G L : \x\ = n} is accepted by C and the running time of 
f(n) is at most q(n). 

The position of the class BQP in the hierarchy of other classes is shown in figure ETT1 



Chapter 5 

New quantum classes 



In this chapter we will define new complexity classes. The goal will be to provide an easy to 
handle hierarchy of simple classes in which the class BQP is contained, and take profit of it 
to show some theorems. We will begin with the definition of BQP, which we reformulated 
in a matrix fashion. 



5.1 The class BQP revisited 

Definition 5.1 {BQP). A language L is in BQP if there is a function 
f : N— > (set of all quantum circuits) computable in polynomial time with respect to its 
argument, such that for each I, its output is a polynomial size quantum circuit T and two 
functions ci{x) and ca{x) computable in polynomial time such that for each x of length I: 



For x € L, 



have 



(c A (x)\T\cj(x)) 



> 



For x (£ L, we have (ca(x)\T\ci (x)} 



< 



In this and all the definitions that follow, we do not require that the functions ci , ca 
have the same range and domain. We for example allow for \cj(x)) = \x,x). The function 
ci(x) corresponds to some transformation of the input, for example adding 'padding' zeros or 
whatever. It can be arbitrary function as long as it is computable in polynomial time. The 
function ca{%) expresses the fact that the accepting state of the register typically depends 
on the input. The only thing we require is that given x, we can compute in a polynomial 
time for which state we are looking for. The equivalence of Definition 15.11 and Definition ^. 51 
is due to Definition 13.51 

In the future, we will frequently need a higher number than the | in the definition. For 
this purpose, we have to prove the following three lemmas. Although its is done compactly 
in 5 , we split it into three lemmas here, because later we will need those parts separately. 

Lemma 5.1. Let us have a language L, a polynomial size circuit T and a function ci(x) 
computable in polynomial time. Let us have a function a(x, y) which decides, for a given 
input x, if y is an accepting configuration or not. If so, it outputs \, otherwise it outputs 0. 
Let us also assume the following: 



For x € L, we have ^2 



y:a(x,y) = l 



(v\T\ci(x)) 



> 



For x L, we have ^2 y . a 



(x,y) = l 



(y\T\ci(x)) 



< 



We can then equivalently amplify the numbers | and | into l — e q ^ and e q ^ for any positive 
polynomial q(n). 
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Proof. This can be done by replicating the circuit k(n) for some polynomial k. As in Lemma 
14.51 we will do a majority vote. The new function a'(x, 2/12/2 • ■ ■ 2/fc(n)) will check that for at 
least half of the ys there is a(x,y) = 1. We have 



T' 



i=l 



and 



a' (x, 2/12/2 • ■■Vk(n)) 



1 if at least a half of the y equals ca {x) 
otherwise 

We define Zi as in Lemma 14.51 and proceed analogously. Then we have 
for x G L: 

kin 

= 1 



E 



<y|T*0|cz(*)> =P >1 



V 6 J 2 



for x ^ L: 



E 

y:ct'(3!,3/lK2— Vfc(„))=l 



(2/|T*W| C/ (x)) =P K>< 



fe(n) 



i=i 



It now suffices to set fc(n) = 2 ■ 3Qq(n). 



□ 



The following lemma shows that for the class BQP, we may equivalently require that 
there be more accepting configurations: 

Lemma 5.2. Let us have a language L, a polynomial size circuit T and a function c/(x) 
computable in polynomial time. Let us have a function a(x, y) which decides, for a given 
input x, if y is an accepting configuration or not. If so, it outputs 1, otherwise it outputs 0. 
Let us also assume the following: 



For xeL, we have T,y. a ( x , y )=i 
For x£L,we have J2y.a(x, y )=i 



(y\T\ci(x)} 



> 



< 



(y\T\c z (x)) 

We can then build a quantum circuit accepting the same language showing that L £ BQP. 
Proof. Using Lemma 15. II we know that we may assume the following: 



For x € L, we have J2y. a (x,y)=i 

\(v\T\ci(x)) 



> 



For x ^ L, we have J2y. a (x,y)=i = (y\T\ci(x)) 



< 



Now, we replicate the input and add another wire in state |0). To the end of the resulting 
circuit, we add another polynomial size circuit that would computes 



f(y,x,o) 



\y,x,l) ifa(x,j/) = l 
\y,x,0) ifa(x,2/) = 



Such circuit certainly exists since a is computable in polytime and / is injective. Then, we 
add T^ 1 working on the first register, to transform (x, y, i) — > (x, x, i). We then set 

c'j(x) D = F |x, x, 0) 
c' A {x) D = F |x,x, 1) 



5.1 The class BQP revisited 
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The resulting polynomial circuit will be denoted by T". 



Now, we look at 



We have 



(c! A (x)\T'\c\{x)) = (z,z,l|T-\/T|x,x,0) = 

{x,x,l\T- l f \y,x,0){y,x,0\T\x,x,0)+ 

y.a(x,y) = l 

foa.llT- 1 / E \y,x,0)(y,x,0\T\x,x,0) 

y:a(x,y)=0 

= E (x,xA\T^\y,x,l){y,x,0\T\x,x,0)+ 

y:a(x,y)=l 

E (x,x, l\T^\y,x,0)(y,x,0\T\x,x,0) 

y:a(x,y)=0 

= E (y> x i l \ T *\ x , x , 1 )(y,x,o\T\x,x,o)+ 

y:a(x,y) = l 

E (v, x ,Q\ T *\ x i x i i)(y, x ,o\T\x,x,o) 



y:a(x,y)=Q 



= E l(2/l T l x ) 

y:a(a;,j/) = l 



Then for i e L, we have that 



and ioi x ^ L 



(4(:r)|T'|c^)> 



&(x)|TK(s)> 



E l^m 

y:a(a;,iy) = l 



E l^m*) 

y:a(a;,y) = l 



> o.r > 



< o.r < 



□ 



Lemma 5.3 (Probability amplification). The numbers |,| m Definition I5.il caw 6e 
amplified to 1 — e~' ? ("- ) 7 e~ 9( ") for an arbitrary positive polynomial q(n). 

Proof. We combine Lemma 15 . 1 1 with the proof method of lemma [5^1 First, we may define a 
function 

def J 1 if y = c A (x) 
I otherwise 



a(x,y) 



then we can amplify the accepting probabilities as in Lemma |5. II We replicate the circuit 
enough times to get probabilities | and |, so we have 



For x £ L, we have E y:o (»,i/)=i (y\ T \ c i( x )) 



> 



For x i L, we have E„ !0 ( ;B ,y)=i = (yl T l c -f 0*0) 



< 



Then we add an extra wire in state |0). To the end of the resulting circuit, we add a unit / 
that will set the state |1) in the extra circuit iff more than half of the circuits output \ca(x)), 
and otherwise leave it untouched. Then we will rollback the algorithm adding an inverse 
circuit of the original one to each subregister. The rollback is necessary because we want to 
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Figure 5.1: The circuit used to amplify the probabilities in Lemma \5.3\ Here we replicate the original 
circuit 5 times. 



have only one accepting configuration for each input x. The situation is illustrated in figure 
15.11 Formally, we define 



c' 7 (x) D ^ F |c 7 (x) fc ,0) 
c' A {x)^ \c { x)\l) 



rpt DEF 



IT 81 / 



IT" 1 01 



and get 



(4(z)|T'|c^)> - (ci(x) k A\ (^(g) Tj <g)lj /M (gjT-M ® lj \cj(x) k ,0) = 
E ( ((g) t) ®l) /|y,0)(y,0| ( ((gjT- 1 L |c/(x) fc ,0) 

y.f(y) = l \\ k ) / VVfe / / 

E ww". 1 ! ((® T ) ® : ) i^ 1 ^ ' ((® T_1 ) 01 ) i c/ ^ fc '°> 
E (<g) T ) I2/X2/] ((g)^ 1 ) m*)*) 

\ k J \ k / 

E (^l (W) l2/)( c Kx) fc | ((g) T*) 

3/:/(3/)=l \ k / \ k / 



E 

2/:/(2/) = l 



<4(x) fe | (g)T 



Now we define new quantities Zi the following way: 



1 if yi = CA{x) 
otherwise 



5.2 The new families 
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For x e L, we have that 2?(z) >l-|+0-| = |. Thus 



(c' A {x)\T'- 1 fT'\c' I {x))= |(2/.0|T'| Cj ( a: ) fe ,0)| = P ( J> > 

y.f( v )=l \i=l 



>P 



fc 3 



s s =1 - p l 



fc 3 



> - > 1 - e^^s- 1 2 
" 6 " 



Similarly, for x L, we have that <l-|+0-| = 4. Thus 



y:f(v)=l V »=1 / 



<P 



To finish the proof, it now suffices to set fc(n) = 36q(n). 



fc 3 



> _ < e -(e) 2 

" 6 " 



□ 



5.2 The new families 

When we were treating the model of a quantum Turing machine in Chapter|21 we encountered 
transition matrices of the machines. They have a size exponential in the length of the input, 
but are unitary and each of their entries is just a simple function, computable in polynomial 
time. In Chapter there was another family of unitary matrices, those that are expressible 
as a polynomial size circuits. Here, we will formalize these two families of unitary matrices 
and then use the formalism to define new complexity classes. 

Definition 5.2 (Unitary serie). An infinite series U\, U2, ■ ■ ■ of unitary matrices is called 
here a unitary series. 

Now, we will formally define two properties of unitary series. The first one, a property of 
being a polynomial size circuit, will be called PI. 

Definition 5.3 (PI). A unitary series Ui, U%, . . . has property PI iff there exist polynomials 
p(i), q(i) and a function f(i) computable in time q(i) such that for all i: f(i) = Ui and Ui is 
expressible as a circuit of size p(i) of at most two-qubit gates. 

This property is very hard to manipulate with, since there are no general theorems avail- 
able that would, seeing a huge matrix, tell us an effective way to find out whether it can be 
decomposed to some number of smaller gates or not. The only way is to try to decompose 
the matrix, which indeed takes time polynomial in dimension of that gate, which usually has 
exponential size itself. 

When we defined a quantum Turing machine, we dealt with series of transition matrices, 
which were also unitary, but had another property, here called P2: 

Definition 5.4 (P2). A unitary series Ui, U%, . . . has property P2 iff there exists a polyno- 
mial p(i) and function f(i, x, y) such that for all i: 

Vx,y : (y|f7i|a:) = f(i,x,y) 

and the running time of f is at mostp(i). 

We know from Section l2.1.1l that for a unitary series, having property P2 is not sufficient 
to be a series of transition matrices of some quantum Turing machines. On the other side, 
the property P2 is very easy to handle and if we just require this and not the existence 
of a quantum Turing machine which would correspond to that matrix, we arrive at another 
complexity class which we know is stronger than or equal to BQP and thus we can eventually 
use this class to prove negative theorems about BQP. 
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For x g L, we have (ca{x) \Tn \ci(x)) > §. 
For x ^ L, we have (ca{x)\T^ \cj(x)) < ^. 



Since we multiply a matrix with two vectors from each side, we silently assume their 
dimensions are compatible. If we used the property PI instead of P2 and some polynomial 
t(n), this definition would be equivalent to the definition of BQP (see dcf. 15. ll ). The reason 
is that if we have polynomial size circuit, we can put a polynomial amount of its identical 
copies, one after another, and we still have a polynomial size circuit. Thus, any polynomial 
power in the definition would not change anything. So we can roughly say that we took the 
definition of BQP and exchanged the two properties PI and P2. 

A circuit with property P2 is much different from a circuit with the property PI. We 
generally do not know how to efficiently implement a given circuit with property P2, which 
means finding a polynomial size circuit being equivalent to it. On the other side if we want 
to know a specific entry of the matrix representing that circuit, this can be answered in a 
polynomial time with respect to the length of the input. It is maybe a bit reminiscent of 
a black box approach: we do not know what a circuit composed of elementary gates would 
look like, but we can use a function to compute any entry of the building block. 

In practice, there is often the situation that we do not know what particular output we 
are looking for, but have some property in mind we would like the outputs to have. This 
situation is expressed in the following definition. We have some function a(x, y) which decides 
for a given x at the input, if y is one of the "good" outputs we are looking for or not. Then 
we want that for "good" xs, e.g. for the xs that are in the language we are interested in, that 
the probability that we obtain one of such be high, but not for the bad xs. 



Definition 5.6 (MQ 1 ^). Let us have an arbitrary function t(n) on integers. A language L 
is in MQ^ 71 ' iff there is a unitary series Ti,T2, . . . having property P2 and functions C[{x) 
and a(x,y) computable in polynomial time such that for each x and its length n: 



In the following text, we will omit the subscripts n on the matrix when it will be clear 
about which n we are talking. 

In the class BQP, it does not matter whether we have one or more accepting configuration 
(see Lemma 15.111 . Here, we can not use the same trick, since generally we can not put there 
an inverse of the matrix since we want that there is only one matrix repeating t(n) times. 
Furthermore, having a function that computes the entries does not immediately imply we can 
a have a function computing the entries of the inverse. Therefore the hierarchies of BQ 1 ^ 
and MQ*(") will generally be distinct. 

Sometimes, we will let the classes MQ and BQ to access oracles and thus define a rela- 
tivized versions of these classes. 

Definition 5.7 (BQ relativized). Let us have an arbitrary function t(n) on integers. A 
language L is in BQ*^ ° for O being an arbitrary oracle over the binary alphabet, iff there 
is a unitary series T\, T2, . . . having property P2 and functions c/(x) and ca(x) computable 
in polynomial time such that for each x and its length n the following holds: 



5.2.2 The classes MQ 



it(n) 



For x ^ L, we have ^2 



For i£L, we have ^2 



y:a(x,y) 



y:a(x,y) 



1 (y\T*^\ Cl (x)) >§. 
x {y\T*W\ Cl {x)) 2 < |. 



5.3 Properties of the new classes 
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For x £ L, we have 



[c A {x)\Tl {n) \ Cl {x)) 



For x Lf we have (cA(x)\Tn |c/(x)) 



< 



Here the entries of the matrices T may be dependent on a polynomial amount of answers of 
the oracle O, but stay unitary regardless of the answers. 

Definition 5.8 (MQ relativized). Let us have an arbitrary function t{n) on integers. A 
language L is in MQ 1 ^ for being an arbitrary oracle over the binary alphabet, iff there 
is a unitary series Ti, T2, . . . having property P2 and functions c/(x) and a(x, y) computable 
in polynomial time for each x and its length n: 



For x £ L, we have 53j/:a(a,3/)=l 



For x<£L, we have J2 y:a ( x ,y)=i (y\ T n (n) \ci{x)) 



2 



2 



Here the entries of the matrices T may be dependent on a polynomial amount of answers of 
the oracle O, but stay unitary regardless of the answers. 

5.3 Properties of the new classes 

In this section, we will show some properties of the newly defined classes. In the first subsec- 
tion, there will be self standing lemmas and observations meant to illustrate the meaning of 
the definitions and the properties PI and P2. In the second subsection, we will state lemmas 
and theorems which will be used to build a hierarchy of the complexity classes. 

5.3.1 Basic properties 

Observation 5.4. For all t(n) it holds BQ t{ ^ C MQ t( - n \ 

Proof. From the definitions. If we set a(x,y) = 1 ca(x) = y, we obtain a special case 

of MQ*(") which equals BQ t(n \ □ 

Observation 5.5. BQ 1 = P. 

Proof. We want to show that BQ 1 C P. For each language L £ BQ 1 we have the following: 

For x i L, we have |(c/(x)|T|c A (x))| 2 > §. 

For x i L, we have \{ Cl {x)\T\c A {x))\ 2 < |. 

Here T is a matrix having property P2 and cj and ca are functions in P. This implies that 
f(x) := |(c/(a;)|T|c y i(a;))| 2 is also in P and this implies that L £ P. □ 

Observation 5.6. (PI -> P2) -> (BQP C P). 

Proof. Using PI — > P2 we get any circuit having property PI, which is the case of BQP, 
can be realized by a matrix having property P2, thus BQP C BQ 1 = P. □ 

Observation 5.7. (P2 -> PI) -> (BQ^ C BQP). 

Proof. Immediate, see Definition 15.51 □ 
Observation 5.8. BQP C BQ polv{n ^ 

Proof. Each problem in BQP has a quantum Turing machine solving it. And each quan- 
tum Turing machine has a series of transition matrices that has property P2 and runs in a 
polynomial time. □ 
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As we were able to amplify the accepting probabilities in the case of BQP (Lemma 15 .30 . 
we would like to be able to amplify the probabilities in case of BQ*M and MQ t{n \ This 
would justify their definitions of them and show their robustness. However, the tricks we 
could do with matrices with property PI can not be straightforwardly applied to matrices 
with property P2. For example, generally we can not compute an inverse of an exponentially 
large matrix in polynomial time. Therefore the question of whether it is possible to amplify 
the class BQ 1 ^ (for nontrivial cases t(n) ^ 1) is left open here. On the other side, the class 
MQ 1 ^ can be easily amplified because it allows for multiple configuration so we can use 
exactly the same trick as in Lemma j5.ll 

Lemma 5.9 (Amplification of MQ). The probabilities | and ^ in the definition of MQ*^ 
may be equivalently changed to any a and b such that a > b + e for some e > 0. 

Proof. Let L be a language in MQ*''™' with the definition changed to a and b satisfying the 
assumption. Let M be the matrix from definition 15.61 showing that L £ MQ*'"' with the 
altered probabilities. We will chose an even polynomial k(n) and make a tensor product of 
k(n) copies of matrices M. As in Lemma [5.31 we will accept iff at least 9 ^ of the copies 
accepts. The new function a'(x, y\yi ■ ■ ■ VMn)) wm check that for at least of the ys there 
is a(x, y) = 1. We have 

k 

T k = 6dM 



and 



a'{x,yiy 2 ■ . .J/fe( n )) 



def j 1 if for at least of the ys there is a(x, y) = 1 



| otherwise 



We define Zi as in Lemma 15.31 and proceed analogously. Then we have 
for 



E 

y:a'(x,y 1 y 2 ...y k („)) = l 



(y\T 



t(n) 



c ,(*))| 2 = p(e^>^) 



> 1-e 



for x ^ L: 



E 



{y\T^\cj{x)) 



y.a l (x,y 1 y 2 ...yk(n)) = 




< e 



To finish the proof, it suffices to notice that the gap between the completeness and soundness 
gets exponentially bigger with increasing k(n), so we can reach any other desired a' and b' 
such that a' > b' + e for some e > 0. □ 



5.3.2 Building the hierarchy 

In this subsection, we will state theorems and lemmas needed to build the hierarchy of the 
classes BQ f ^ and MQ f( - n \ 

Except for the class BQ 1 , which was shown to be equal to P in observation 15 . 3 . ll MQ 1 
will be the lowest class in the hierarchy, because we will show later on that MQ 1 C BQ 2 . 
Still, it is already superior to BPP, the class that is believed to encompass all efficiently 
solvable problems. 

Observation 5.10. MQ 1 D BPP. 



5.3 Properties of the new classes 
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Proof. Let us have a language L in BPP, a polynomial p(n), and a probabilistic Turing 
machine M with running time p(n) accepting L, proving that L £ BPP. Let r(n) be the 
amount of random bits used for inputs of length n. We know that r(n) < p(n). Let M(x, r) 
denote the output (either 1 or 0) of M over input x and random bits r. Here 1 stays for 
accepting configuration and for rejecting. Then the probability of acceptation over x reads: 

P ^ = ^M) E M ^y) 

y.\y\=r(\x\) 

We can straightforwardly realize this probability of acceptation by running the machine M 
on all possible y's of length r(\x\) with equal amplitude, and then sum up the results with 
help of accepting function a(x, (y, z)), that will accept iff M(x, z) will accept. For obtaining 
equal amplitudes, we will use the Hadamard gate on each qubit. 

The probability of acceptation, P(x), will be thus obtained defining 

Cl (x)^\x,0 r ^), 
a(x, (y,z)) °= F M(x,z), 
(i\T\j) °= 1 



where ik is the fc-th bit of i, because we have that 



E 

(y,z):a(x,(y,z))=l 



l,z\T\ Cl (x))\ = £ \(y,z\T\x,0 r ^) 

(y,z):M(x,z)=l 



E E 

V z:M(x,z) = \ 



1 



V2R2KNT 



E E 2\*\t^ E T {\x\) p ( a; ) 

y z:M(x,z) = l z:M(x,z) = l 

□ 

Later on, with the help of GapP functions, we will state stronger theorems about the 
classes MQ p ^ n > for a polynomial p(n), which will immediately lead to the following observa- 
tion as a corollary. We still mention this observation here because it provides a good insight 
into the classes. 

Observation 5.11. For each polynomial p(n) , the class MQ p( - n ^ C PSPACE. 

Proof. It suffices to show that for any functions cj{x) and a(x,y) computable in polytimc 
and a matrix series T having property P2, the logical function 



F(x,n)=\ J2 \(y\T p(n) \ci(x))\ 2 > 

\y:a(x,y) = l 



defines a language L = {(x, n)\F(x, n) = 1} that is in PSPACE. We have 

(y\T^\cj(x)) = (y|T| il )(z 1 |TK 2 )...^ ( „ ) „ 1 |TK p( „ ) )( ip( „ ) |T| C/ (x)) 

Each element of the sum is a function computable in polytime with respect to n, thus comput- 
ing it can not require more than a polynomial amount of space. What we need to remember 
when computing the sum is: The actual sum and indices ■ .i p ( n )- This requires only 

size polynomial with respect to n. To compute the sum J2 y . a ( x y )=i\{y\T plyn ^\ci{x))\ 2 , it 
suffices to remember the actual index y. □ 
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The following lemma is a handy tool telling us that we do not have to bother with 
hcrmitian conjugates of the matrices in the definition of the class MQ 1 , since all of them can 
be defined as hcrmitian. 

Lemma 5.12. Without loss of generality, me may require that the matrix T in the definition 
of BQ 1 and MQ 1 be hermitian. 

Proof. Let us have a matrix T and functions ci{x) and ca(x). We then define a new matrix 
S as 

(y, 0\S\x, 0) D =(y, 1|%,1)=0 
(y,0\S\x,l) D ^ F (y\T\x) 
(y,l\S\x,0) D ^ F (y\T^\x) 



The new matrix is illustrated in Figure 15^1 a). Such a matrix is clearly hermitian. It is also 
unitary, since we have 



S 



T 
Tt 



) 



ss^-s 2 - ( TTt 1 - r 1 1 

Now, we are ready to define 

c'j(x) °= \ci(x), 1) 

c' A ( x ) °= \c A (x),0) 

a'(x, (yJ)) °= a(x,y)S i0 



Then we have (c' A (x)\S\c r ]: (x)) = (ca{x)\T\cj(x)) and thus 

\{cU(x)\S\c'Ax))\ 2 = \{c A (x)\T\ Cl (x))\ 2 

and 

£ IMS|c'z(z)>l 2 = E \(v\ T Mx))\ 3 

(y,i):a'(x,(y,i)) = l y.a(x,y)=0 

□ 

In the definition of the classes BQ 1 ^ and MQ^ n \ the function t(n) reflects the power of 
the class and complexity of the problems in it. The higher the value t(n), the more work we 
have to do to obtain the result. One would thus expect that for each integer i, BQ 1 C BQ l+1 
and MQ 1 C MQ l+1 . However, it is not obvious why this should hold. If we for example 
imagine a language L in MQ 7 , it is not clear how to accept it in MQ S , because if we take 
a matrix M that serves to prove L 6 MQ 7 , once we compute its 8th power, its entries will 
typically be very different and useless. What would save us is if we could construct another 
matrix from the matrix M such that in its 8th power, there are somewhere the entries of the 
7th power of M , or their simple transformations, like multiples etc. Here in the two following 
theorems we construct such composite matrices to show MQ 1 C MQ k for any k > 1. Other 
such relations, for example whether also MQ 2 C MQ for any k > 2 are left open here. 

Theorem 5.13. The following holds: 

MQ 1 C MQ 1+4k 

MQ 1 C MQ 2+4k 

MQ 1 C MQ 3+4k 



for any nonnegative integer k. 
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x,0 


x,1 




r 




y,o 





T 


y,i 


T + 




J 



a) 



Figure 5.2: The new matrices for 





x,0 


x,1 




r 







wT 


/2 1 


y.1 


I* 1 


J 



b) 



proofs of lemmas \5.12\ and \5.13l 



Proof. Let us have a language T G MQ 1 , a matrix T and functions c/(x) and a(x, y) solving 
it. We know from Lemma 15 . 1 21 that we may assume the matrix T is hermitian. That means 
we have T = and TTt = T 2 = 1. Then, let us define a matrix S as 

(y,0\S\x,0) D ^ (y,l\S\x,l) = -±=(y\T\x) 



V2 



(y,0\S\x,l) °= -j=S xy 
(//• i|- s 'k- (l ) °= F 



The new matrix is illustrated in Figure [^21 b). Such a matrix is unitary, because we have 







1 






T 






-1 






Tt 


i( 


TT t . 


f 1 




- T 



V2 



ssi--\ " 1 * ~ T + Tt 1 - if 2 ° 1 -1 

-T+ + T (-l)(-l)+TTt J 2 I 2 J 
Now, we are ready to define Cj(a;) D = F \ci(x), 1) and a'(x, (y,i)) = §ioa(x,y). Then, as 



S 1 



s 2 



T 2 - 1 T + T 
-T-T -1 + T 2 



) = (-£ 



we get 

E I^.is^IcjCx), i; 

(j/,i):a'(x,(j/,i)) = l 

showing MQ 1 C MQ 2 . 

Alternatively, when we define c'^x) D = F |c/(x), 0) and a!(x, (y, i)) D = r Sioa(x, y), then since 



E |<fl r M*)> 

y:a(x, j/)=l 



S 3 = ^f 

y/2 



we get 



C J 1 1 w T i 1 r — T 1 ^ 
I 1 T J I -T J 7f L 1 -T i 

E (tf.il^Mx),!)^ J E |<w|r|cz(a:)> 



(y,i):a'(a,(j/,i))=l 



S/:a(x,j/)=l 



So, instead of accepting probabilities | and |, we would obtain ^ and ^ respectively. However, 
this can be cured by lemma IFHI to finally show that MQ 1 C MQ 3 . 
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It also holds that S 4 = —1, since we have that 



S 4 = 



( T } 2 _ f -1 O^j 
{ -T J I -1 J 



Thus, adding a power of 4 will only change the global phase of the resulting matrix. That 
means we may add a power of any multiple of 4 to the equations 15.3.21 and 15.3.21 without 
changing the right hand side. This proves the rest. □ 

Theorem 5.14. The following holds for any nonnegative integer k: 

MQ 1 C MQ 4k 



Proof. We will use exactly the same technique as in the theorem 15.131 only the matrix will 
be a little bit more complicated. Let us have a language L £ MQ 1 , a matrix T and functions 
Ci(x) and a(x, y) solving it. We know from Lemma 15 . 1 21 that we may assume that the matrix 
T is hermitian. We define a matrix R as 



1 



S H 



R V2 ( V s 



where S is the matrix from the proof of Lemma |5 . 1 31 and H is the Hadamard matrix. The 
matrix R is unitary, because we have 



V2^ H -S 1 



2 SH- HS T 



-SH + HS 1 



and 



HST ~ 2 ( 1 -1 ) ( ^ t) 



( 



T+ 1 T - 1 
T-l -1-T 



) 

2 ( -1 T ) [ 1 -1 ) = SH 



It can be verified that 



R 4 





( 


-i 


T 


1 


-1 


\ 


1 




-T 


-1 


-1 


-1 


2 




-1 


1 


-1 


T 






K 


1 


1 


-T 


-1 


/ 



Now, we are ready to define c'j(x) — \ci(x),l) and a'(x,(y 7 ij) — Sioa(x,y), so that 
means we will look onto the second column and the first row. We get 



E 

(y,i):a'(x,(y ,%))=! 



(y,i\R 4 \ Cl (x),l) 



- Yv\ T \ci{x)) 



y.a{x,y) = l 



the probabilities | and ^ are changed by a factor of j, but we may still get them back using 
Lemma f5. 91 □ 
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Corollary 5.15. For any integer k > 1, MQ 1 C MQ k . 
Proof. Follows from Theorems 15 . 1 31 and 15 . 141 



□ 



In the case of BQP, we could equivalently allow for multiple accepting configurations. On 
the contrary, here its not clear whether BQ 1 ^ — MQ 1 ^ for some t(n), not even whether 
MQ'^ C BQ l for some t'(n). The only such question answered here is the following: 

Theorem 5.16. MQ 1 C BQ 2 . 

Proof. Let T, cj and a(x,y) be the items as in the definition 15.61 that give the evidence for 
L E AIQ 1 . We will use the amplification in Lemma f5. 91 and assume we have 



For x € L; Y.y. a{x , y)=1 \(v\T^\ci{x))\ 2 > §. 

For x ^ L: E H : a (»,»)=l l<2/|^™)| C/ (x))| 2 < J. 
Furthermore, we will assume that for each we have 

(i\T\j) = (j\T*\i). 

Such an assumption is justified by lemma 15.121 because we may assume that the matrix T is 
hcrmitian and have 

(i\T\j) = (j\T T \i) = (j\(Tiy\i) = {j\T*\i). 
We will define a matrix M and functions c'j(x), c' A (x) as 

(i,b\M\j,c) ^ -L(-i)«<*(-i)W-°<.*A)(i\T\j) 

|4(s)> D = M^.O) 
|4(z)) =>/(*), 0) 



Such a matrix is unitary, because it is a tensor product of the Hadamard matrix and the 
matrix T (which is unitary by assumption), with some rows multiplied by —1. Then we have 

(c' A (x)\M 2 \c'j(x)) = Y l (ci(x),0\M\i,b)(i,b\M\c I (x),0) = 

i,b 

£^(-l)°*A(-l) 0(1 -^ 

^J2(-lf^-^\{i\T\ Cl (x))\ 2 = 
i,b 

^(E(- 1 ) 0(1_a(3; ' l)) |< i i T i c H^>| 2 +E(- 1 ) 1(1 ^ ( ^ )) |( j i T i c ^ a; )) 

i i 

l(i+ Yl \mci(x))\ 2 - E |<*i t m*)}| 



i:a(x,i) — l 



i:a(x,i)—0 



1(2 E I^m*)) 



Then for i e L, we have that 



(c^M^)) 



i:a(x,t)=l 



E KvTOf 

j/:a(x,s/)=l 



E |<<i r i c 'W> 

i:o(x,i)=l 



9/ - 3 
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and for x (fc L we have that 



(^mc'Ax)^ 2 = \ J2 Nn*>i 2 | 2 <Q) 



2 



< - 

~ 3 



y:a(x,y)=l 

□ 

5.4 Putting some problems into the classes 

Now we will accommodate some well known problems in the new classes, to illustrate their 
power. Since we will also ask what functions the classes can compute, we will first define 
what we mean by that: 

Definition 5.9. A function f(x) is computed by a class C iff for each index i we have 
L °= {< x,i> \ f(x)i = 1} G C where f(x)i denotes the i—th bit of f(x). 1 

Definition 5.10 (NP-Search problems). A NP-Search problem is the following: For a 
given function f(x,y) with the range of {0,1} computable in time p(n) — poly(n) and for a 
given x, output g{x) such that 

(g(x) = y & f{x, y) = 1) V (q(x) = & Vy = {0, 1}*<»> : f(x, y) = 0) 

As was shown in 0], the class BQP can only lead to a polynomial speedup over the 
class BPP when solving ./VP-Search problems. The following theorem shows that the classes 
MQpW for some polynomial p and even their union MQ poly ^ are no better in this aspect. 

Theorem 5.17. MQ poly ^ does not solve NP-Search problems given as oracle. More pre- 
cisely: 

For each NP-search problem f(x, y) computable in a polynomial time, let us define an oracle 

O f D ^{(x,y)\f(x,y) = l} 
Then for each polynomial p(n) there exists an Of such that 

Li = {<x,i> \g{x)i = 1} i MQ p ^° f 

Proof. We will proceed using the polynomial method in a similar way as it was applied in 
From the definition of MQ P ^ , we know that there exists a polynomial r{n), such that 
(y|T|a;) is a function of x and y computable in time r(n). Thus, it can ask the oracle at 
most r(n) times. As in we may express each element (y|T|x) as a polynomial of variables 
indicating presence of pairs in the oracle of degree at most r(n). It follows that (y\T p ^\x) 
is for each x and y a polynomial of degree at most p(n)r(n) and its square of degree at most 
2 (p(n)r(n)). It follows that 

\(y\T p ^\ Cl (x)) 2 



E 

y.a(x,y) = l 



is expressible as a polynomial of degree at most 2 (p(n)r(n)). However, the or function of 
n variables is a polynomial of degree n. Since we have exponential amount of pairs, we can 
not realize their or function with a polynomial of a degree 2p(n)r(n), which is still only a 
polynomial. □ 

Corollary 5.18. There exists oracle O such that BQ 2 ° ^ NP° . 

1 As with conventional quantum circuits, we will not prove directly that the language 
L = {< x,i > \f(x)i = 1} is in a class C, but only that the language Li = {x\f(x)i = 1} is in C for 
all i and furthermore the algorithm that decides a language Li is for all i the same and only takes i as 
parameter. 
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a) 

10) 
10) 
I0> 

I0> 
I0> 
I0> 




(-1) 




b) 

I0> 
I0> 
I0> 
I0> 
I0> 
I0> 



(-1) 



H 



(-1) 



H' 



Figure 5.3: The trick used to St the Deutsch-Jozsa algorithm into the BQ 2 class. In a), there is the 
original setup, in b) there is the one we use, which gives the same result up to a global phase. 



Except for the trivial class BQ 1 , which equals to P fObservation l5.3.T|l . the lowest class in 
the hierarchy is the class MQ 1 , because we have MQ 1 C BQ 2 in Theorem 15. 161 The second 
lowest class is then obviously BQ 2 . Yet it captures the famous Deutsch-Jozsa problem. For 
this problem, see [Jj]. 

Theorem 5.19. The class BQ 2 solves the Deutsch-Jozsa problem. More precisely, given a 
function f(x) — > {0, 1} that is either constant or balanced 2 and an oracle 



the language 



L 



O f = {x\f(x) = 1} 



{0 n |n arbitrary integer } if the function f is constant 



{} 



if the function f is balanced 



is in BQ 2Qf . 

Proof. We will realize the circuit in the Deutsch-Jozsza algorithm (see Figure l5~3l a.)) by two 
equal matrices T having the property PI (see Figure ED D )). The matrix T will be a product 
of a matrix realizing x — » (— l)^ x \ as in the original version of the algorithm, and H n . We 
may add another matrix for the / function to the front, since it will only add the number 
(_l)/(o) t the global phase and thus will not change the result. Formally, we define a matrix 
T as 



(y\T\x) °= {-l) f W{y\H n \x) = -^(-lj/Wf-ijEi^w mod 2 

v 2" 

which is obviously computable in polytime and unitary. We define Ci(x) = 0" and ca(x) = r 
for x of length n. Then we have 

(0"|T 2 |0")| 2 = |^( "|T|fe){A;|T|0 n ) 2 - 

k 

EC l)f(k) 1 f i\T. f kiO n j mod 2 1 / l)f(0 n )( l)T,,0 n ik, mod2| 2 = 



|Ei(- 1 ) /(fc) (- 1 ) as:,k 



i0"i mod 2 



2 -i|E(-D /(fe)2 



2 With balanced function we mean a function that outputs 1 on exactly half of the inputs and zero on the 
other half. 
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a) 





b) 

10) 
10) 
10) 
10) 
10) 

io> 





Figure 5.4: The trick used to fit the Shor algorithm into the MQ 2 class. In a), there is the original 
setup, in b) there is the one we use, which gives the same result up to a global phase. 



If the function is constant, then the sum Ylk(~ 1)^ ^ equals ±2™ and the probability 

2 

equals 1. If the function is balanced, both the sum and the probability is 



(o n |T 2 |o n ; 

0. 



□ 



Corollary 5.20. There exists an oracle O such that BQ 2 ° NP. 



Perhaps not surprisingly, to realize the famous Shor algorithm |14| . we also need only to 
square a matrix, this time however we need to have multiple accepting configurations. 

Theorem 5.21. The class MQ 2 solves the factoring problem. More precisely, there exists a 
constant k such that given numbers x and N > k as in the Shor algorithm, the language 

L = {< N,i > \x a mod N has a period r whose i-th bit is 1} 

is in MQ 2 . 

Proof. We will use the same notation as in the original paper by Shor |14j . N is the number 
to factorize and N = P1P2 where both pi p2 are primes and are different from each other. 
We then arbitrarily choose an x coprime to N . The pair (cc, N) is the input of the algorithm. 
The goal is to find the smallest r / such that x r mod N = 1. This r is called a period of 
x. We choose a number q such that q is a power of 2 and q > 2 2 ^ l ° 92N ^ . This number will, 
together with the length of x and N, determine the size of the matrix. 

As in the previous lemma, we will convert the circuit into the desired form by adding 
another unit to the front which will only affect the global phase. See Figure IQ1 

The space we will work with is spanned by vectors of the form \x, N, a, i), where the third 
and the fourth part of the vector are arbitrary integers from the interval [0, q— 1]. From now 
on, if we do not specify the sums bounds, we mean summing over the range of the variable. 
We first define two matrices called DFT and MOD in the following manner: 



1 2ix / 

(x',N',a',i'\DFT\x,N,a,i) °= —S x . xl 6 N . N ,dia>e~ aa 
(x , N', a ', i'\M OD\x, N, a, i) D = r — 8 XtX >8N,N>8 a ,a'Si'+i,x a mod n 



Now, we may simply check that both the matrices are unitary: 
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(x 1 ,N 1 ,a 1 ,i 1 \DFT\x',N',a',i')(x 2 ,N 2 ,a 2 ,i 2 \DFT\x',N',a',i') = 

x',N',a',i' 



- 2^71 / 

^ 3x 2 ,x'0N2,N'0i 2 ,i ,e q 

x',N',a',i' ^ 

]T -6x u x 2 6 NuN J il , i ^ ai - a > )a 'Sx 1 ,x 2 5 NuN2 6 il , i2 ]T e^ (ai - a2)a ' - 

a' ^ a' 

±5x 1 ,x 2 6 Nli N 2 6i 1 ,i 2 e 4 L ( (a 1 1 _„ 2 2 ) )'~ i 1 = iffai^a 2 
= 1 otherwise 



. 9 



<5xi , X2<5tVi ,N 2 <^ii ,i 2 <^ai ,o 2 



x'N'a'i 



Y (xi, N lt ai ,h\MOD\x', N', a', i')(x 2 ,N 2 , a 2 , i 2 \MOD\x', N', a', i') 

v 

Y - S x u x'SN u N'Sa u a'S il+il xa ' mod N S X2iX > £jV 2 ,iV' S a2 , a ' 5 V _ i+2;X a' 
!V',a',i' 9 

^ ' — ^Xi,x 2 ^JVi, N 2 $a 1 ,a 2 fii 1 +i' ,x a i mod N Si' +i 2 .x a i modJf 

4> Q 



Now we are ready to define a unitary matrix T = DFT ■ MOD. Its elements read 



(x',N',a',i'\T\x,N, a,i) — 5 x , x >5 N . N >e 1 a a 8 l+l > ^ mod n 



and are clearly functions computable in polytime. 
The function cj will be defined as 

d{x,N) D = F \x,N,0,0) 
Now, we will compute the probability of outputting configuration \x, N,n^,i) for any 
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integer < n < r — 1 and any integer < i < q — 1. We have 



(x' ,N' ,a' ,i'):a'—nq/r 



{x',N',a',i'\T J \c I (x,N)) 



I J] (x'^'^'^'lrla;',^'^"^")^',^'^"^"^^,^^^) 

{x f >N' ,a' ,i f )\ a" ,i" 
a' —nq/r 

| XI (aJ,JV,o' I * / |T|aj,JV,o w > *' / )<a: > JV > a" > i // |T|aj,JV,0 J Q) 

[a' ,i'):a'—nq/r a" ,i" 

2 



E l~E-°' a ^ 



(a' ,i'):a' —nq jr a" ,i" 



i' ,x a mod AT 



E ^2 | E 

(a' ,i'):a'=nq/r a":x a " mod JV=i'+l 

r-1 x Lf|-1 

E E ^2 E 

:a l =nq/r fc— 0:a"— aJ 

E E ilE 



e ^ 



a' :a' —nq/r k—0:a' / —a'J-\-kr 
r-l , LfJ-1 



L a kr 



i'—O a'-.a'—nq/r k—Q 
r-l r-l L"fJ — ! 



E E a 2 | E 



e « 



■ (nq/r)kr 



n=0 i'=0 k 
r—1 r—1 _^ L^J — 1 2 r — 1 r — 1 ^ 

EE^I E I — EE ^2 l~j 



n=0 i'=0 



2r r 2 



2r 



27V 



LrJ 2 > -(r-l) =!-- + -> i--> 1-^2= i-- 



g 2 r (j 2 r 



9 q 



N 



We see that the output from the algorithm, with a high probability, will be some multiple 
of -. Using the chained fractions method, we know that we can determine the r from this 
number with a quite good accuracy, see again ^1]. As in the normal Shor algorithm, this 
must be done by some classical postprocessing. The postprocessing will be contained in our 
accepting function a((x, N),y), which will then check that the i-th bit is 1. To feed the 
function with more multiples of £ at the same time, we may simply do a tensor product of 
any polynomial amount of our matrices T and redefine the functions cj and a in an analogous 
way. We know that there exists a polynomial p (length (AT)) such that if we feed the function 
with p( length (AT)) values, then 



P(we obtain correct r | given p(length(iV)) of multiples of q/r) > - 
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That means that 

P(we obtain correct r) = 

P(we obtain correct r | given p(length(iV)) of multiples of q/r) 
P(given p(length(iV)) of multiples of q/r) + 
P(we obtain correct r | given something else) P (given something else) 

> P(wc obtain correct r | given p (length (N)) of multiples of q/r) ■ (1 - _)J>(length(iV)) 

2 

+ P(we obtain correct r | given something else) • (_)P( lcn s th ( Ar )) > 



- 4 v 7V ; 



> ^(l-Klength(iV))-i) 



which is greater than § for N > |P(length(iV)). There are only constantly many N for 
which this inequality is not satisfied. □ 



5.5 The class M 2 

In this section, we will loosen Definition 15 . 51 even more and define yet another new complexity 
class, which will however turn out to be equal to another already a known class. This will 
close from above the hierarchy presented here. 

Definition 5.11 (M'(">). A language L is in M'*"' iff there is a series Ti,T2,... (which 
does not have to be unitary) having property P2 and functions ci(x) and ca{x) computable 
in polynomial time, such that: 

Forx£ L, we have 1 > |(cA(a;)|r i(rl) |c/(a;)) | 2 > §. 
Forxi L, we have < \{c A {x)\T t ^\c I {x))\ 2 < |. 

Lemma 5.22. Let us have a language L £ M 2n and a matrix T and functions cj(x) 
and ca{x) solving it. Then there exists a matrix S whose entries are computable in poly- 
time and functions c'j(x) and c' A (x) computable in polytime such that {ca{x)\T 2ti \ci(x)) — 
(c^lSV/M). 

Proof. We extend the matrix T adding one line to the top and one line to the left, filling 
them with zeros except the top left corner, where 1 will sit. Formally, we will have a new 
matrix T" such that 

(0\T'\x) = F (x|T|0) D = S x0 
(x\T'\y) D ^ F (x~l\T\y-l) 

Let the size of the matrix T" be N. The size of the new matrix S will be N(2n — 1) Now we 
are ready to define the matrix S as 

(a\S\b) °= F (ai\T'\a 2 )(as\T'\a4) . . . (a in - 3 \T'\a4 n - 2 ) 

(a 4 „-l|r'|6 1 )(6 2 |T'|6 3 ) ■ ■ • (&4n- 2 |T'|&4n-l) 

and functions c' A (x) D = F |0, . . . , 0, ca(%) + 1) and c'j(x) °= F \ci(x) + 1, 0, . . . , 0) to get 
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(c' A (x)\S 2 \c'j(x)) = (0, . . . , 0, c A (x) + l|S 2 | C/ (x) + 1,0, ... ,0) 

= ^(0, . . . , 0, c A {x) + l\S\^(i\S\ Cl (x) + 1, 0, . . . , 0) 

i 

= £)<0|T'|0> n - 1 <CA(aO + l|T'|i 1 )<i 2 |T'|i 3 )(i4|T , |i5) • ■ • (i2n-2\T'\i 2n -i) 

i 

(k\T'\i 2 )(i 3 \T'\u) . . . (i an _ 3 |T'|i ana )<i 4n _ 1 |r , |c J (x) + IXOIT'10) 2 "- 1 
= (0|T'|0) 2 - 2 "£(c A {x) + l\r\i 1 )(i 1 \T'\i 2 ){ia\r\i a ) . . . 

i 

. . . (i 2 n-2|7 1 '|i2, l -i)(i 2 » l -i|T'|c/(a;) + 1) 
= (0|T'|0) 2 "- 2 <c A (z) + l\T' 2n \ Cl (x) + 1) = (c A (x)\T 2n \ Cl (x)) 

□ 

Corollary 5.23. M p = M 2 for any polynomial p. 

Proof. For even values of the polynomial p, Lemma 15.221 can be used. For odd values the 
following trick can be used: Having a matrix M that serves to show that a language L is in 
M p , we define for each integer k a new matrix 

s(k) °zr f Me ^ _ to J ] 

Then it can be verified that for any integer k > 1 we have 

S y f j 

Therefore, using the matrix S instead of M, we can add one to the power and thus arrive 
at an even number in the exponent. We can still find the desired entries in the top right 
corner. □ 

Corollary 5.24. BQ p 2 oly C M 2 . 

Proof. Follows directly from definitions 15. 515. Ill and the previous corollary. □ 

Lemma 5.25 (Amplification for the M 2 class). For each L € M 2 and each polynomial 
q(n), it holds that L £ M 2 with completeness 1 — e~ 9( ") and soundness e~ q ( n \ 

Proof. The proof method from Lemma 15.91 may be applied analogously here. □ 

Lemma 5.26 (Multiple accepting configurations for A/ 2 ). Let us have a language L 
such that there exists a matrix T and functions ci(x) and a(x,y) computable in polynomial 
time such that: 

Forxe L, we have 1 > |£, :a( ^ )=1 (y|T'(") | Cj (x)) \ 2 > §. 

Forxi L, we have < |E„ : „(,,„)=i(j/|r t(n) \ci(x)) \ 2 < |. 
Then L E M 2 . 

Proof. We need two functions c'j(x) and c' A (x) computable in polytime, a polynomial q(n) 
and a matrix M such that 
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For x £ L, we have 1 > 



(c'^lM^lc^)) 



> 



For x i L, we have < (c' A {x)\M^\c'j{x)) 



< 



Let T, ci and a(x, y) be the items as in Definition !.^. 61 that give the evidence for L £ MQ 
We will use the amplification in lemma 15 .251 and assume we have 



p(n) 



> 



For a; 



E vMx , v) =i (y\T^\ Cl (x)) 



To capture multiple accepting configurations into the notion of the class M 2 , we will use 
the following trick: We will obtain the sum 



J2 kwlT^Mx)) 



y:a(x,y) = l 



in a matrix element of M p+1 . The matrix M will have three indices, x,x' and y. The third 
index will be used to count the number of elements. In the expression for M p+ , we will 
want to have p times pairs of elements of T and one time a pair of indicators whether a 
corresponding pair of elements of T is accepting or not. Therefore, in the space of the third 
index, the matrix will have nonzero elements only on the main diagonal shifted up by 1, 
which will cause that in the expression of M p+1 , each element of M will have the index of 
the following one plus 1. Thus, if we start with 1, the first element will have index p + 1. 
This element will be 1 if corresponding element in T has the function a equal to 1, or zero 
otherwise. 

Formally, we define a matrix M and functions c^(x), c' A (x) as 



SvuV3+l( X A T \ x *)Wl\ T *W2) '^V2^P 



(x 1 ,x l 1 ,y 1 \M\x2,x' 2 ,y 2 ) = { 8 yuy2+l S x ^ 

, 

c 'i( x ) D = F \ci(x),ci(x), 1) 

4(x) D ^ F |o,o,p + i) 



if a(x,X2) — 1 
and 1/2 —P 

otherwise 
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Then we have 



(c A (x)\MP +1 \ Cl (x)) ^ (0 : 0,p+l\MP +1 \c I (x),c I (x),l) = 

E(°' 0,p+ l|M|ii, ji, k 1 )(i 1 ,j 1 , kx\M\i 2 J 2 ,k 2 ) ■ ■ ■ (i P ,j P , k p \M\c I (x),c I (x), 1) 



i,j,k 



)(h,ji,kx\M\i 2 ,j 2i k 2 ) 
• ■ ■ (ip-i,j p -i, kp-i\M\i p ,j p ,2){i p ,jp,2\M\ci(x),c I (x), 1) 

J2{0,0,P+l\M\i 1 ,j 1 ,p){ii,j2,p\M\i 2 ,j 2 ,p-l) . . . 



1,3 



(ip-i,3p-i, 2|M|i p , j p , 2)(i p ,jp, 2\M\c I (x),c I (x), 1) 



E l-(ii\T\i 2 )...(ip^\T\ip)(ip\T\ Cl (x)) 

i:a(x,ii) — l 

E 1 • 0'ilrlia) • • ■ <j P -i|T*|j p ><j p |T*|c 7 (z)> 

J:jl=«l 

E (*i|^|c / ( a; )>(*i|r* p |c7(x))= J] l(H|T p | C/ (x))| 2 



21 :a(x,ii)=l 



ii :a(x,ii) — 1 



We obtained 
For x E L: 



(c A (x)|M" +1 |c 7 (x)) 



For x L: 



[c A {x)\M^\ Cl {x)) 



E |n tp I c '(*)> 

ii:a(a;,ii)=l 



E |<«'ii TP i^W) 

ix :a(a;,ii)=l 



■'5 £ 5 



Due to the assumption, we also have that Y^i ± -a(x h)=i (^\^ P \ c i( x )) 
M shows that L € M p ^ +1 . Using lemma it follows that L E M 

Corollary 5.27. MQf lv C Af 2 

Proof. Follows the definition of 15. 61 15.111 and the previous lemma. 



< 1. So the matrix 



□ 



Lemma 5.28 (Real numbers in M 2 ). Let us have a language L S M 2 . Then there exist a 
matrix M documenting that L S MQ*^ such that there are only real numbers as the entries 
ofM. 

Proof. Let T, cj(x) and ca(x) be the witnesses of L G M 2 . We define 

def / Re —Im 



M 



Im Re 



I4(^)> D = 1^(^,0) 
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where Re D = F ReiT) and Im D = F ImiT). Then we get 
M 2 -- 



Re 2 - Im 2 
ImRe + Relm 



ImRe — Relm 
Re 2 - Im 2 



For each x, y, we may write 



(x\T 2 \y) = (x\ {Re + Urn)) 2 \y) = (x\(Re 2 - Im 2 )\y) + i(x\(ReIm + ImRe)\y) 



and so we have 

2 2 

(x\T 2 \y) = (x\(Re 2 - Im 2 )\y) + (x\(ReIm + ImRe)\y) 

This means that the norm of pairs of entries in the corresponding positions in the submatrices 
in the first column is equal to the norm of the corresponding entry in the original matrix T. 
We get 



E 

{y,y'):a(x,(y,y'))=l 



(y^M^x)) 



c A (x),0\M 2 \ Cl (x),0) + (c A {x),l\M 2 \ Cl {x),0) 



(c A (x)\T 2 \cj(x)) 



The proof is finished using Lemma 15.261 since the construction there does not introduce any 
new complex numbers. □ 

Lemma 5.29 (Precision required in class M 2 ). Let us have a language L £ M 2 . Then 
there exist a matrix M documenting that L € MQ^ such that for some polynomial q(n), 
all elements in the matrix M are exact multiples of ^ . 

Proof. Let T be a witness of L e M 2 and let the size of the matrix T be 2 p (") for p being 
a polynomial. We know from lemma 15 . 281 that we may assume there are only real entries in 
the matrix T. We define M as 



(i\M\j) D =f= F sign«z|T|j» 



1 



2«(") 



2"(")|(i|T|j)| 



Then the biggest error we may get when computing the result over an input is 

2 



{i\T*\j) - (i\M 2 \j) 



< 2 p(r ' 



2<j(r, 



_ 2 l +p( n )-q( n ) 



because when computing an entry of T 2 , we are summing over one index with 2 p (") values 
and the items in the sum are products of 2 items of precision . Setting q(n) > p(n) + 4, 
we get that the error is at most |. Using the amplification lemma 15. 251 we can get back the 
numbers ^ and |. Furthermore, since |(z|M|j)| < |(i|T|j)|, we have that 



(z\M 2 \j) < ^\T 2 \j) 



< 1 



□ 



In the following lemma, we provide a characterization of the class M 2 , which we will use 
later on. 

Lemma 5.30 (M 2 characterization). L E M 2 there exists a function G(x,y) : 

({0, 1}™, {0, l} ra ) — > { — 1, 1} computable in polynomial time and a polynomial p(n) such that 
for each x: 

ifxELthen\j: v G(x,y)\ 2 >2PMl 
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if x i L then\J2 y G(x,y)\ 2 < 2^)1, 
where n = lengthix). Furthermore, it holds that 



\E v G(x,y)\ 2 

2 p(n) 



< 1. 



Proof. Using the setting and notation from the definition of M 2 , let n be the length of cj(x). 
We define the function F(x, k) as 



and write 



F(x,k) = (c A (x)\T\k)(k\T\ Cl (x)} 



(c A (x)\T 2 \ Cl (x))\ 2 = \J2{c A (x)\T\k){k\T\ Cl (x))\* = \J2n*,k)\ 2 < 1 



k=\ 



k=l 



The inequality follows from the definition of the class M 2 (Definition 15.11(1 . The range of 
the functions F(x, k) is the interval [—1, 1]. Using lemma 15.291 we know that we may assume 
there exists a polynomial q(n) such that the functions F(x, k) output multiples of . Now, 
we may define function G(x, (y, z)) : ({0, 1}™, ({0, 1}", {0, l} 9(n) )) {0, 1} as 



Then we have that 



and thus 



1 iiF(x,y)>0k F(x,y)>zfa 
-1 iiF{x 1 y)<0&F{x 1 y)<^ 

otherwise decide with probability \ between —1 and 1 

Y,G{x,{ y ,z))=F{x,y)2« n ) 

Z 

|^^G( a; ,(fc^)) 2 =2 2 «(")|^F( : r,fc) 



fe=l z 

Setting p(n) = 2g(n) proves the lemma. 



fc=i 



□ 



5.6 GapP functions 

In this section, we will present a definition of the so called GapP functions defined in and 
the class AWPP which is defined using these functions. Then we will prove M 2 — AWPP. 

Definition 5.12 (Gap function). A function f is in the class Gap iff there exists a non- 
deterministic Turing machine M running in polynomial time such that f{x) is the number 
of accepting paths of M over x minus the number of rejecting paths of M over x. 

In |13|. the following lemma is shown: 

Lemma 5.31 (Gap functions properties). Let f(x) € Gap and q a polynomial. Then 
the following are functions in Gap: 

1- -f{x)„ 

®- 1Z\y\<q(\x\) f\ X iV)> 

3- Yl v<q{ \ x \)f[x,y)- 

We will use also the following, very handy characterization, proved in 

Lemma 5.32 (AWPP characterization). A language L is in AWPP iff there exists a 
polynomial p and a function g £ Gap such that: 
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For x € L, we have § < g(x)/2 p < 1. 

For x 4_ L, we have < g{x)/2 p < §. 

The authors also showed that the constants ^ and | may be amplified to 2~" and 1 — 2~" 
respectively. 

Lemma 5.33 (Characterization of BPP using GapP). Let us have a language L E 
AWPP and g{x) andp as in Lem,ma \5.S 6 3. showing that L G AWPP , and a nondeterministic 
Turing machine M{x,y) such that its gap is exactly g(x) and the length of y is exactly p. 
Then L G BPP. 

Conversely, for each L G BPP, we can find such g(x) and M(x, y). 

Proof. First, let us have a language L, g(x) and M(x, y) as above and let us denote M(x. y) = 
1 if M over input x with random bits y accepts and M(x, y) = otherwise. Then 

g(*)= E 1 E 1 

y:M(a: lV )=l y.M(x,y)=0 

and according to the assumption about the length of y: 

2 *> = E i+ E i- 

y.M(x,y) = l y.M(x,y)=0 

Thus, solving these two equations, we get that: 

If g(x)/2P > | then P(M accepts x) = £m( ^ )=i1 > \ + | and 

if g(x)/2 p < \ then P(M accepts x) = £M( ; p y)=l1 < \ + ±. 

Using Lemma f5. 31 we get that L G BPP. 

On the other side, if we have L G BPP and a machine M(x, y) with p random bits 
accepting it, then 

For x G L: l>^>§-§ = §, 

for^L: ^<|-| = -i. 

We will therefore add another two bits &i, 62 to the machine M and define: 



1 <=► M(z,y) = IV 61 ^62 
otherwise 



Then P(M accepts x) = 1 - P y (M(x, y) = 0)~ and thus 
F°rzGL:l>#>(l-ii)-(ii)=§, 
For 1 4 L: < 4I 1 < (l - ^ - ^ - 1 



2P — \ 3 2 J \ 3 2 / 3 : 

□ 

Theorem 5.34. AVFPP D M 2 

Proof. Using Lemma f5.30l we know that for each language L G M 2 there exists a function 
G{x, y) computable in polytime and a polynomial p(n) such that for each x: 



iixeL then |£ y G(x, y)\ 2 > 2f(™)|, 
ifx^Lthen|^G(x,y)| 2 <2P(")i, 



48 



New quantum classes 



where n = length(x). Since the range of G is only {1, —1}, we have that G £ GapP. Using 

Lemma 15.311 we have that \ J2 y G(x, y)\ 2 S Gap as well. It remains to prove ^ y ^,^ V ^ < 1, 
which is also thanks to Lemma 15. 301 □ 



f2 



Theorem 5.35. AWPP C M 

Proof. Let us have a language L S AWPP, a function g(x) € GapP, and a polynomial p 
proving that L £ AWPP as in Lemma T5.32I We assume that g and p are amplified to 

For x E L, we have § < g{x)/2 p < 1. 

For x L, we have < g(x)/2 p < |. 

Let us now have a nondeterministic Turing machine M(x,y) for which the function <?(a;) 
computes the gap. We then define a matrix M as 

(z,y|M|z',y'> D = 2' p / 2 5 x ^{-l)^^{-l) M ^ 

Such a matrix is furthermore orthogonal, since its proportional to the Hadamard matrix 
working on the second register with some rows multiplied by -1, which does not change the 
unitarity. But here we will not need this fact. We get 

(x,0\M 2 \x,0) = Y / (x,0\M\i,j)(i,j\M\x,0) = 

J2(^\M\x,j)(x,j\M\x,0) = 
j 

^2 2 _p (-l)^' 0lJl (-l) M(2: ' 0) (-l) S ' Jl ' 0l (-l) M(:cj) 



2 -p(_ 1 )M(*,o) ^(_r 



M(x,j) 



2p 



And thus 



2 



For x e L, we have 1 > |(x, 0|M 2 |x, 0)| 2 = (^Y > (|) 2 > |. 



For x i L, we have < |<sc, 0|M 2 |a;, 0)| 2 = (s^) 



<(l) 2 <i 



This finishes the proof. □ 

The following theorem is a corollary of Observation 15. lUl saving that BPP C MQ 1 and 
Theorerr l5.16l saving MQ 1 C BQ 2 . Nevertheless, a direct proof of BPP C BQ 2 is much 
easier and elegant: 

Theorem 5.36. BPP C BQ 2 . 

Proof. We proceed exactly as in the proof of Theorem 15.351 Using Lemma 15.331 we know 
that there is a machine M(x,y) with exactly p random bits. Thus the matrix M defined as 

(x,y\M\x',y>) ^ ^^(-^^(-l)^) 

is properly normalized, since there are exactly 2 P different y's and so each row and column 
has a unit norm. The matrix is furthermore unitary, since is an entry of a 

Hadamard matrix and the element (— l) M ( a: >W multiplies some columns by —1. Multiplying 
some columns by —1 obviously can not change orthogonality of columns, but it can neither 
change the orthogonality of rows, since always both the corresponding entries in the rows 
have their sign changed and thus in the inner product the sings will cancel. We then conclude 
that the matrix M is unitary. □ 
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PSPACE 




P=BQ 2 



Figure 5.5: Hierarchy of classes including some new ones. For each pair connected by a line, the 
class that stays upper contains the lower one. MQ k>2 denotes any class MQ k for a constant k > 2. 



5.7 Hierarchy with the new classes 

The hierarchy is visualised on figure 
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Conclusions and remarks 



We summarized the concept of Quantum Computing, both from the view of quantum Turing 
machines and the view of Quantum circuits. We also showed the relation between the two 
models and touched the phenomenon of entanglement. Then we reviewed some properties of 
the class BQP, a class of problems that are efficiently solvable by quantum Turing machines 
and quantum circuits. 

In Chapter we define a new bunch of classes inspired by the quantum world and the 
class BQP. Their definitions are natural generalizations or modifications of the definition of 
BQP. One part was obtained by changing the condition that a circuit be of a polynomial 
size into the condition that each element of a matrix representing a circuit be a polynomial 
time computable function and we may have more identical matrices in the circuit. So we 
arrived at a bunch of classes BQ t(jl ^ for t(n) being some function that tells us how many 
identical matrices in the circuit we have for an input of length n. This definition is very easy to 
work with. Generalizing this definition to the possibility of more accepting configurations, we 
arrived at the family MQ l ^ n \ The relation between the classes is that for each i: BQ l C MQ\ 
All these classes form a huge hierarchy and are all contained in the class AW PP. However, 
none of the inclusions is known to collapse, nor to be proper. Showing a proper inclusion 
would immediately lead to P ^ PSPACE, which is considered to be a very hard question. 

From above, it followed that BQP° l vW D BQP. Nevertheless, we showed that the two 
main quantum algorithms, Deutsch-Jozsza and Shor algorithm belong to the class BQ 2 and 
M Q 2 respectively, which raises the question whether the poly(n) in BQ poly ^ can be bounded 
for example to be a linear function or a constant, or at least to put BQP C MQ 1 ^ with 
t(n) being something smaller than a general polynomial. 

To come back to the classical world, we loosened the definitions of the classes MQ t(jl ^ and 
BQ t( - n ' even more and dropped the unitarity condition of the matrices to arrive at the class 
M 2 which is shown to be equal to the class AWPP defined by 10.. This class was invented 
as an artificial class meant to be used just for proving lowness theorem of BQP. Here we 
showed that its definition was surprisingly natural. 

The advantage of all these new classes is that they sit in the hierarchy between BQP, 
BPP and AWPP and are £2 definable with easy definitions resembling the definition of BPP 
and BQP, and so fill the gap between the wellknown classes BQP and AWPP in a hierarchy 
where proving many things is considered to be very hard, such as whether AWPP = BQP. 
The classes suggested here therefore leave place for future proofs which might lead to showing 
some properties of BQP itself. They may also help to understand better how the unitarity 
of the nature influences computation in general. 
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